Search results for: RDF DATASET PROFILING

How Specific Can We Be with k-NN Classifier?

Publication

- Year 2014

This paper discusses the possibility of designing a two stage classifier for large-scale hierarchical and multilabel text classification task, that will be a compromise between two common approaches to this task. First of it is called big-bang, where there is only one classifier that aims to do all the job at once. Top-down approach is the second popular option, in which at each node of categories’ hierarchy, there is a flat classifier...

Full text to download in external service

Hasse diagram as a green analytical metrics tool: ranking of methods for benzo[a]pyrene determination in sediments

Publication

P. Bigus
S. Tsakovski
V. Simeonov
J. Namieśnik
M. Tobiszewski

- ANALYTICAL AND BIOANALYTICAL CHEMISTRY - Year 2016

This study presents an application of the Hasse diagram technique (HDT) as the assessment tool to select the most appropriate analytical procedures according to their greenness or the best analytical performance. The dataset consists of analytical procedures for benzo[a]pyrene determination in sediment samples, which were described by 11 variables concerning their greenness and analytical performance. Two analyses with the HDT...

Full text available to download

Residual MobileNets

Publication

- Year 2019

As modern convolutional neural networks become increasingly deeper, they also become slower and require high computational resources beyond the capabilities of many mobile and embedded platforms. To address this challenge, much of the recent research has focused on reducing the model size and computational complexity. In this paper, we propose a novel residual depth-separable convolution block, which is an improvement of the basic...

Full text to download in external service

Expectation-Maximization Model for Substitution of Missing Values Characterizing Greenness of Organic Solvents

Publication

- MOLECULES - Year 2018

Organic solvents are ubiquitous in chemical laboratories and the Green Chemistry trend forces their detailed assessments in terms of greenness. Unfortunately, some of them are not fully characterized, especially in terms of toxicological endpoints that are time consuming and expensive to be determined. Missing values in the datasets are serious obstacles, as they prevent the full greenness characterization of chemicals. A featured...

Full text available to download

Analysis of results of large-scale multimodal biometric identity verification experiment

Publication

- IET Biometrics - Year 2018

An analysis of a large set of biometric data obtained during the enrolment and the verification phase in an experimental biometric system installed in bank branches is presented. Subjective opinions of bank clients and of bank tellers were also surveyed concerning the studied biometric methods in order to discover and to explore relations emerging from the obtained multimodal dataset. First, data acquisition and identity verification...

Full text available to download

Occurrence of Cyanobacteria in the Gulf of Gdańsk (2008–2009)

Publication

- Year 2022

Blooms of cyanobacteria develop each summer in the Baltic Sea. Collecting complete data on this phenomenon is helpful in understanding the changes taking place in the Baltic Sea and forecasting the occurrence of these phenomena in the future. This dataset includes unpublished information about the occurrence of cyanobacteria in the Gulf of Gdańsk (Southern Baltic) in 2008 and 2009. The presented data combines basic physic-ochemical...

Full text available to download

Photos and rendered images of LEGO bricks

Publication

T. M. Boiński

- Scientific Data - Year 2023

The paper describes a collection of datasets containing both LEGO brick renders and real photos. The datasets contain around 155,000 photos and nearly 1,500,000 renders. The renders aim to simulate real-life photos of LEGO bricks allowing faster creation of extensive datasets. The datasets are publicly available via the Gdansk University of Technology “Most Wiedzy” institutional repository. The source files of all tools used during...

Full text available to download

Preeclampsia Risk Prediction Using Machine Learning Methods Trained on Synthetic Data

Publication

M. Mazur-Milecka
N. Kowalczyk
K. Jaguszewska
D. Zamkowska
D. Wójcik
K. Preis
H. Skov
S. R. Wagner
P. Sandager
M. Sobotka
J. Rumiński

- Year 2024

This paper describes a research study that investigates the use of machine learning algorithms on synthetic data to classify the risk of developing preeclampsia by pregnant women. Synthetic datasets were generated based on parameter distributions from three real patient studies. Four models were compared: XGBoost, Support Vector Machine (SVM), Random Forest, and Explainable Boosting Machines (EBM). The study found that the XGBoost...

Full text to download in external service

LSA Is not Dead: Improving Results of Domain-Specific Information Retrieval System Using Stack Overflow Questions Tags

Publication

S. Olewniczak
J. Szymański
P. Malak
R. Komar
A. Letowska

- Year 2024

The paper presents the approach to using tags from Stack Overflow questions as a data source in the process of building domain-specific unsupervised term embeddings. Using a huge dataset of Stack Overflow posts, our solution employs the LSA algorithm to learn latent representations of information technology terms. The paper also presents the Teamy.ai system, currently developed by Scalac company, which serves as a platform that...

Full text available to download

The Belt and Road Initiative and export variety: 1996–2019

Publication

- Asian-Pacific Economic Literature - Year 2024

This study examines the association between the Belt and Road Initiative (BRI) and export variety (EV). We propose three hypotheses on how BRI may foster export markets (destinations) or export product lines. The estimates are based on a dataset constructed specifically for this analysis, covering 183 countries and linked with trade data from 1996 to 2019. We apply the instrumental variable (IV) approach in regressions for covering the...

Full text to download in external service

Instance segmentation of stack composed of unknown objects

Publication

- ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE - Year 2023

The article reviews neural network architectures designed for the segmentation task. It focuses mainly on instance segmentation of stacked objects. The main assumption is that segmentation is based on a color image with an additional depth layer. The paper also introduces the Stacked Bricks Dataset based on three cameras: RealSense L515, ZED2, and a synthetic one. Selected architectures: DeepLab, Mask RCNN, DEtection TRansformer,...

Full text available to download

A Bayesian regularization-backpropagation neural network model for peeling computations

Publication

S. Gouravaraju
J. Narayan
R. Sauer
S. S. Gautam

- JOURNAL OF ADHESION - Year 2023

A Bayesian regularization-backpropagation neural network (BRBPNN) model is employed to predict some aspects of the gecko spatula peeling, viz. the variation of the maximum normal and tangential pull-off forces and the resultant force angle at detachment with the peeling angle. K-fold cross validation is used to improve the effectiveness of the model. The input data is taken from finite element (FE) peeling results. The neural network...

Full text available to download

Forecasting risks and challenges of digital innovations

Publication

M. Sikorski

- Year 2020

Forecasting and assessment of societal risks related to digital innovation systems and services is an urgent problem, because these solutions usually contain artificial intelligence algorithms which learn using data from the environment and modify their behaviour much beyond human control. Digital innovation solutions are increasingly deployed in transport, business and administrative domains, and therefore, if abused by a malicious...

Full text to download in external service

Development of cluster analysis methodology for identification of model rainfall hyetographs and its application at an urban precipitation field scale

Publication

K. Mikołajewski
M. Ruman
K. Kosek
M. Glixelli
P. Dzimińska
P. Ziętara
P. Licznar

- SCIENCE OF THE TOTAL ENVIRONMENT - Year 2022

Despite growing access to precipitation time series records at a high temporal scale, in hydrology, and particularly urban hydrology, engineers still design and model drainage systems using scenarios of rainfall temporal distributions predefined by means of model hyetographs. This creates the need for the availability of credible statistical methods for the development and verification of already locally applied model hyetographs....

Full text available to download

Vehicle detector training with minimal supervision

Publication

- Year 2019

Recently many efficient object detectors based on convolutional neural networks (CNN) have been developed and they achieved impressive performance on many computer vision tasks. However, in order to achieve practical results, CNNs require really large annotated datasets for training. While many such databases are available, many of them can only be used for research purposes. Also some problems exist where such datasets are not...

Automatic Threat Detection for Historic Buildings in Dark Places Based on the Modified OptD Method

Publication

W. Błaszczak-bąk
C. Suchocki
J. Janicka
A. Dumalski
R. Duchnowski
A. Sobieraj-Żłobińska

- ISPRS International Journal of Geo-Information - Year 2020

Historic buildings, due to their architectural, cultural, and historical value, are the subject of preservation and conservatory works. Such operations are preceded by an inventory of the object. One of the tools that can be applied for such purposes is Light Detection and Ranging (LiDAR). This technology provides information about the position, reflection, and intensity values of individual points; thus, it allows for the creation...

Full text available to download

Material for Automatic Phonetic Transcription of Speech Recorded in Various Conditions

Publication

- Year 2016

Automatic speech recognition (ASR) is under constant development, especially in cases when speech is casually produced or it is acquired in various environment conditions, or in the presence of background noise. Phonetic transcription is an important step in the process of full speech recognition and is discussed in the presented work as the main focus in this process. ASR is widely implemented in mobile devices technology, but...

Full text to download in external service

Processing of LiDAR and Multibeam Sonar Point Cloud Data for 3D Surface and Object Shape Reconstruction

Publication

- Year 2016

Unorganised point cloud dataset, as a transitional data model in several applications, usually contains a considerable amount of undesirable irregularities, such as strong variability of local point density, missing data, overlapping points and noise caused by scattering characteristics of the environment. For these reasons, further processing of such data, e.g. for construction of higher order geometric models of the topography...

Full text to download in external service

INFLUENCE OF DATA NORMALIZATION ON THE EFFECTIVENESS OF NEURAL NETWORKS APPLIED TO CLASSIFICATION OF PAVEMENT CONDITIONS – CASE STUDY

Publication

- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Year 2018

In recent years automatic classification employing machine learning seems to be in high demand for tele-informatic-based solutions. An example of such solutions are intelligent transportation systems (ITS), in which various factors are taken into account. The subject of the study presented is the impact of data pre-processing and normalization on the accuracy and training effectiveness of artificial neural networks in the case...

CNN Architectures for Human Pose Estimation from a Very Low Resolution Depth Image

Publication

P. Szczuko

- Year 2018

The paper is dedicated to proposing and evaluating a number of convolutional neural network architectures for calculating a multiple regression on 3D coordinates of human body joints tracked in a single low resolution depth image. The main challenge was to obtain a high precision in case of a noisy and coarse scan of the body, as observed by a depth sensor from a large distance. The regression network was expected to reason about...

Full text to download in external service

Methodology of Constructing and Analyzing the Hierarchical Contextually-Oriented Corpora

Publication

- Year 2018

Methodology of Constructing and Analyzing the Hierarchical structure of the Contextually-Oriented Corpora was developed. The methodology contains the following steps: Contextual Component of the Corpora’s Structure Building; Text Analysis of the Contextually-Oriented Hierarchical Corpus. Main contribution of this study is the following: hierarchical structure of the Corpus provides advanced possibilities for identification of the...

Full text available to download

Visual Content Learning in a Cognitive Vision Platform for Hazard Control (CVP-HC)

Publication

C. Silva de Oliveira
C. Sanin
E. Szczerbicki

- CYBERNETICS AND SYSTEMS - Year 2019

This work is part of an effort for the development of a Cognitive Vision Platform for Hazard Control (CVP-HC) for applications in industrial workplaces, adaptable to a wide range of environments. The paper focuses on hazards resulted from the nonuse of personal protective equipment (PPE). Given the results of previous analysis of supervised techniques for the problem of classification of a few PPE (boots, hard hats, and gloves...

Full text available to download

Crowdsourcing-Based Evaluation of Automatic References Between WordNet and Wikipedia

Publication

- INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING - Year 2019

The paper presents an approach to build references (also called mappings) between WordNet and Wikipedia. We propose four algorithms used for automatic construction of the references. Then, based on an aggregation algorithm, we produce an initial set of mappings that has been evaluated in a cooperative way. For that purpose, we implement a system for the distribution of evaluation tasks, that have been solved by the user community....

Full text available to download

Analysis of the Capability of Deep Learning Algorithms for EEG-based Brain-Computer Interface Implementation

Publication

- Year 2023

Machine learning models have received significant attention for their exceptional performance in classifying electroencephalography (EEG) data. They have proven to be highly effective in extracting intricate patterns and features from the raw signal data, thereby contributing to their success in EEG classification tasks. In this study, we explore the possibilities of utilizing contemporary machine learning algorithms in decoding...

Full text to download in external service

Driver’s Condition Detection System Using Multimodal Imaging and Machine Learning Algorithms

Publication

- Year 2023

To this day, driver fatigue remains one of the most significant causes of road accidents. In this paper, a novel way of detecting and monitoring a driver’s physical state has been proposed. The goal of the system was to make use of multimodal imaging from RGB and thermal cameras working simultaneously to monitor the driver’s current condition. A custom dataset was created consisting of thermal and RGB video samples. Acquired data...

Full text to download in external service

A novel approach exploiting properties of convolutional neural networks for vessel movement anomaly detection and classification

Publication

- ISA TRANSACTIONS - Year 2022

The article concerns the automation of vessel movement anomaly detection for maritime and coastal traffic safety services. Deep Learning techniques, specifically Convolutional Neural Networks (CNNs), were used to solve this problem. Three variants of the datasets, containing samples of vessel traffic routes in relation to the prohibited area in the form of a grayscale image, were generated. 1458 convolutional neural networks with...

Full text available to download

Focus on Misinformation: Improving Medical Experts’ Efficiency of Misinformation Detection

Publication

A. Nabożny
B. Balcerzak
M. Morzy
A. Wierzbicki

- Year 2021

Fighting medical disinformation in the era of the global pandemic is an increasingly important problem. As of today, automatic systems for assessing the credibility of medical information do not offer sufficient precision to be used without human supervision, and the involvement of medical expert annotators is required. Thus, our work aims to optimize the utilization of medical experts’ time. We use the dataset of sentences taken...

Full text to download in external service

Global value chains and wages under different wage setting mechanisms

Publication

- Competition & Change. The Journal of Global Business and Political Economy - Year 2023

This study examines whether, and how, differences in wage bargaining schemes shape the relationship between global value chains (GVCs) and the wages of workers while considering both GVC participation and position in GVC. Our dataset is derived from the European Structure of Earnings Survey (SES), containing employee–employer data from 18 European countries, merged with sectoral data from the World Input-Output Database (WIOD)....

Full text available to download

Intelligent Audio Signal Processing − Do We Still Need Annotated Datasets?

Publication

B. Kostek

- Year 2022

In this paper, intelligent audio signal processing examples are shortly described. The focus is, however, on the machine learning approach and datasets needed, especially for deep learning models. Years of intense research produced many important results in this area; however, the goal of fully intelligent signal processing, characterized by its autonomous acting, is not yet achieved. Therefore, a review of state-of-the-art concerning...

Full text available to download

Multi-task Video Enhancement for Dental Interventions

Publication

- Year 2022

A microcamera firmly attached to a dental handpiece allows dentists to continuously monitor the progress of conservative dental procedures. Video enhancement in video-assisted dental interventions alleviates low-light, noise, blur, and camera handshakes that collectively degrade visual comfort. To this end, we introduce a novel deep network for multi-task video enhancement that enables macro-visualization of dental scenes. In particular,...

Full text to download in external service

A Triplet-Learnt Coarse-to-Fine Reranking for Vehicle Re-identification

Publication

E. Katsaros

- Year 2020

Vehicle re-identification refers to the task of matching the same query vehicle across non-overlapping cameras and diverse viewpoints. Research interest on the field emerged with intelligent transportation systems and the necessity for public security maintenance. Compared to person, vehicle re-identification is more intricate, facing the challenges of lower intra-class and higher inter-class similarities. Motivated by deep...

Full text to download in external service

Methods for quality improvement of multibeam and LiDAR point cloud data in the context of 3D surface reconstruction

Publication

- HYDROACOUSTICS - Year 2016

Point cloud dataset is the transitional data model used in several marine and land remote-sensing applications. During further steps of processing, the transformation of point cloud spatial data to more complex models containing higher order geometric structures like edges and facets may be possible, if an appropriate quality level of input data is provided. Point cloud datasets usually contain a considerable amount of undesirable...

Full text available to download

Vehicle detector training with labels derived from background subtraction algorithms in video surveillance

Publication

- Year 2018

Vehicle detection in video from a miniature station- ary closed-circuit television (CCTV) camera is discussed in the paper. The camera provides one of components of the intelligent road sign developed in the project concerning the traffic control with the use of autonomous devices being developed. Modern Convolutional Neural Network (CNN) based detectors need big data input, usually demanding their manual labeling. In the presented...

Improving Traffic Light Recognition Methods using Shifting Time-Windows

Publication

A. Blokus
H. Krawczyk

- Year 2018

We propose a novel method of improving algorithms recognizing traffic lights in video sequences. Our focus is on algorithms for applications which notify the driver of a light in sight. Many existing methods process images in the recording separately. Our method bases on the observation that real-life videos depict underlying continuous processes. We named our method FSA (Frame Sequence Analyzed). It is applicable for any underlying...

Full text to download in external service

Improving methods for detecting people in video recordings using shifting time-windows

Publication

A. Blokus
H. Krawczyk

- Year 2018

We propose a novel method for improving algorithms which detect the presence of people in video sequences. Our focus is on algorithms for applications which require reporting and analyzing all scenes with detected people in long recordings. Therefore one of the target qualities of the classification result is its stability, understood as a low number of invalid scene boundaries. Many existing methods process images in the recording...

Full text to download in external service

Areas of Updraft Air Motion in an Idealised Weather Research and Forecasting Model Simulation of Atmospheric Boundary Layer Response to Different Floe Size Distributions

Publication

M. Wenta

- Year 2022

Presented dataset is part of a numerical modelling study focusing on the analysis of the influence of sea ice floe size distribution (FSD) on the horizontal and vertical structure of convection in the atmosphere. The total area and spatial arrangement of the up-drafts indicates that the FSD affects the total moisture content and the values of area averaged turbulent fluxes in the model domain. In fact, while convective updrafts...

Full text available to download

Exploring Relationships Between Data in Enterprise Information Systems by Analysis of Log Contents

Publication

Ł. Korzeniowski
K. Goczyła

- Year 2024

Enterprise systems are inherently complex and maintaining their full, up-to-date overview poses a serious challenge to the enterprise architects’ teams. This problem encourages the search for automated means of discovering knowledge about such systems. An important aspect of this knowledge is understanding the data that are processed by applications and their relationships. In our previous work, we used application logs of an enterprise...

Full text to download in external service

Optimized Computational Intelligence Model for Estimating the Flexural Behavior of Composite Shear Walls

Publication

M. Mirrashid
H. Naderpour
D. N. Kontoni
A. Jakubczyk-Gałczyńska
R. Jankowski
T. N. Nguyen

- Buildings - Year 2023

This article presents a novel approach to estimate the flexural capacity of reinforced concrete-filled composite plate shear walls using an optimized computational intelligence model. The proposed model was developed and validated based on 47 laboratory data points and the Transit Search (TS) optimization algorithm. Using 80% of the experimental dataset, the optimized model was selected by determining the unknown coefficients of...

Full text available to download

A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces

Publication

G. Tamulevicius
G. Korvel
A. B. Yayak
P. Treigys
J. Bernataviciene
B. Kostek

- Electronics - Year 2020

In this research, a study of cross-linguistic speech emotion recognition is performed. For this purpose, emotional data of different languages (English, Lithuanian, German, Spanish, Serbian, and Polish) are collected, resulting in a cross-linguistic speech emotion dataset with the size of more than 10.000 emotional utterances. Despite the bi-modal character of the databases gathered, our focus is on the acoustic representation...

Full text available to download

Ranking Speech Features for Their Usage in Singing Emotion Classification

Publication

- Year 2020

This paper aims to retrieve speech descriptors that may be useful for the classification of emotions in singing. For this purpose, Mel Frequency Cepstral Coefficients (MFCC) and selected Low-Level MPEG 7 descriptors were calculated based on the RAVDESS dataset. The database contains recordings of emotional speech and singing of professional actors presenting six different emotions. Employing the algorithm of Feature Selection based...

Full text available to download

Pursuing the Deep-Learning-Based Classification of Exposed and Imagined Colors from EEG

Publication

A. A. Torres-García
J. S. Garcia Salinas
L. Villaseñor-Pineda

- LECTURE NOTES IN COMPUTER SCIENCE - Year 2022

EEG-based brain-computer interfaces are systems aiming to integrate disabled people into their environments. Nevertheless, their control could not be intuitive or depend on an active external stimulator to generate the responses for interacting with it. Targeting the second issue, a novel paradigm is explored in this paper, which depends on a passive stimulus by measuring the EEG responses of a subject to the primary colors (red,...

Full text to download in external service

Photoplethysmographic Time-Domain Heart Rate Measurement Algorithm for Resource-Constrained Wearable Devices and its Implementation

Publication

- SENSORS - Year 2020

This paper presents an algorithm for the measurement of the human heart rate, using photoplethysmography (PPG), i.e., the detection of the light at the skin surface. The signal from the PPG sensor is processed in time-domain; the peaks in the preprocessed and conditioned PPG waveform are detected by using a peak detection algorithm to find the heart rate in real time. Apart from the PPG sensor, the accelerometer is also used to...

Full text available to download

Investigating Feature Spaces for Isolated Word Recognition

Publication

P. Treigys
G. Korvel
G. Tamulevicius
J. Bernataviciene
B. Kostek

- Year 2020

The study addresses the issues related to the appropriateness of a two-dimensional representation of speech signal for speech recognition tasks based on deep learning techniques. The approach combines Convolutional Neural Networks (CNNs) and time-frequency signal representation converted to the investigated feature spaces. In particular, waveforms and fractal dimension features of the signal were chosen for the time domain, and...

Full text to download in external service

Intracranial electrophysiological recordings from the human brain during memory tasks with pupillometry

Publication

J. Cimbalnik
J. Dolezal
C. Topcu
M. Lech
V. Marks
B. Joseph
M. Dobias
J. Van Gompel
G. Worrell
M. T. Kucewicz

- Scientific Data - Year 2022

Data comprise intracranial EEG (iEEG) brain activity represented by stereo EEG (sEEG) signals, recorded from over 100 electrode channels implanted in any one patient across various brain regions. The iEEG signals were recorded in epilepsy patients (N=10) undergoing invasive monitoring and localization of seizures when they were performing a battery of four memory tasks lasting approx. 1 hour in total. Gaze tracking on the task...

Full text available to download

Simulations of the Derecho Event in Poland of 11th August 2017 Using WRF Model

Publication

- Year 2022

This series contains datasets related to the forecasting of a severe weather event, a derecho, in Poland on 11 August 2017. The simulations were conducted using the Weather Research and Forecasting (WRF) model version 4.2.1 with different initial and boundary conditions of the pressure and model levels derived from 5 global models: Global Forecast System (GFS), Global Data Assimilation System (GDAS), European Centre for Medium-Range...

Full text available to download

Changes in gene methylation patterns in neonatal murine hearts: Implications for the regenerative potential

Publication

- BMC GENOMICS - Year 2016

Background The neonatal murine heart is able to regenerate after severe injury; this capacity however, quickly diminishes and it is lost within the first week of life. DNA methylation is an epigenetic mechanism which plays a crucial role in development and gene expression regulation. Under investigation here are the changes in DNA methylation and gene expression patterns which accompany the loss of regenerative potential. Results The...

Full text available to download

Automatic recognition of males and females among web browser users based on behavioural patterns of peripherals usage

Publication

A. Kołakowska
A. Landowska
P. Jarmolkowicz
M. Jarmolkowicz
K. Sobota

- Internet Research - Year 2016

Purpose The purpose of this paper is to answer the question whether it is possible to recognise the gender of a web browser user on the basis of keystroke dynamics and mouse movements. Design/methodology/approach An experiment was organised in order to track mouse and keyboard usage using a special web browser plug-in. After collecting the data, a number of parameters describing the users’ keystrokes, mouse movements and clicks...

Full text to download in external service

Comparative Analysis of Metabolic Variations, Antioxidant Profiles and Antimicrobial Activity of Salvia hispanica (Chia) Seed, Sprout, Leaf, Flower, Root and Herb Extracts

Publication

S. Motyka
B. Kusznierewicz
H. Ekiert
I. Korona-Głowniak
A. Szopa

- MOLECULES - Year 2023

The purpose of this study was to evaluate the phytochemical profiles of the seeds, sprouts, leaves, flowers, roots and herb of Salvia hispanica and to demonstrate their significant contribution to antioxidant and antimicrobial activities. Applied methods were: HPLC-DAD coupled with post-column derivatization with ABTS reagent, untargeted metabolomics performed by LC-Q-Orbitrap HRMS, and two-fold micro-dilution broth method, which...

Full text available to download

Predicting sulfanilamide solubility in the binary mixtures using a reference solvent approach

Publication

P. Cysewski
M. Przybyłek
T. Jeliński

- Polimery w Medycynie - Year 2024

Background. Solubility is a fundamental physicochemical property of active pharmaceutical ingredients. The optimization of a dissolution medium aims not only to increase solubility and other aspects are to be included such as environmental impact, toxicity degree, availability, and costs. Obtaining comprehensive...

Full text available to download

Genre-Based Music Language Modeling with Latent Hierarchical Pitman-Yor Process Allocation

Publication

S. Raczyński
E. Vincent

- Year 2014

In this work we present a new Bayesian topic model: latent hierarchical Pitman-Yor process allocation (LHPYA), which uses hierarchical Pitman-Yor pr ocess priors for both word and topic distributions, and generalizes a few of the existing topic models, including the latent Dirichlet allocation (LDA), the bi- gram topic model and the hierarchical Pitman-Yor topic model. Using such priors allows for integration of -grams with a topic...

Full text to download in external service

Search

Filters

Catalog

Category

Year

Options

Search results for: RDF DATASET PROFILING