Wyniki wyszukiwania dla: TEXT CLASSIFICATION

Wyniki wyszukiwania dla: TEXT CLASSIFICATION

wyników na stronę:
osadź ten widok na swojej stronie

Filtry

wszystkich: 112

wyczyść wszystkie filtry niedostępne

Selection of Relevant Features for Text Classification with K-NN
Publikacja
- Rok 2013
In this paper, we describe five features selection techniques used for a text classification. An information gain, independent significance feature test, chi-squared test, odds ratio test, and frequency filtering have been compared according to the text benchmarks based on Wikipedia. For each method we present the results of classification quality obtained on the test datasets using K-NN based approach. A main advantage of evaluated...

Pełny tekst do pobrania w serwisie zewnętrznym
Methodology for Text Classification using Manually Created Corpora-based Sentiment Dictionary
Publikacja
- N. Rizun
- W. Waloszek
- Rok 2018
This paper presents the methodology of Textual Content Classification, which is based on a combination of algorithms: preliminary formation of a contextual framework for the texts in particular problem area; manual creation of the Hierarchical Sentiment Dictionary (HSD) on the basis of a topically-oriented Corpus; tonality texts recognition via using HSD for analysing the documents as a collection of topically completed fragments...

Pełny tekst do pobrania w portalu
Text Documents Classification with Support Vector Machines
Publikacja
- P. Majewski
- Rok 2008
Comparative Analysis of Text Representation Methods Using Classification
Publikacja
- J. Szymański
- CYBERNETICS AND SYSTEMS - Rok 2014
In our work, we review and empirically evaluate five different raw methods of text representation that allow automatic processing of Wikipedia articles. The main contribution of the article—evaluation of approaches to text representation for machine learning tasks—indicates that the text representation is fundamental for achieving good categorization results. The analysis of the representation methods creates a baseline that cannot...

Pełny tekst do pobrania w serwisie zewnętrznym
A multi-label text message classification method designed for applications in call/contact centre systems
Publikacja
- K. Poczeta
- M. Płaza
- T. Michno
- M. Krechowicz
- M. Zawadzki
- APPLIED SOFT COMPUTING - Rok 2023
Pełny tekst do pobrania w serwisie zewnętrznym
An Analysis of Neural Word Representations for Wikipedia Articles Classification
Publikacja
- J. Szymański
- N. Kawalec
- CYBERNETICS AND SYSTEMS - Rok 2019
One of the current popular methods of generating word representations is an approach based on the analysis of large document collections with neural networks. It creates so-called word-embeddings that attempt to learn relationships between words and encode this information in the form of a low-dimensional vector. The goal of this paper is to examine the differences between the most popular embedding models and the typical bag-of-words...

Pełny tekst do pobrania w serwisie zewnętrznym
Two Stage SVM and kNN Text Documents Classifier
Publikacja
- M. Kępa
- J. Szymański
- Rok 2015
The paper presents an approach to the large scale text documents classification problem in parallel environments. A two stage classifier is proposed, based on a combination of k-nearest neighbors and support vector machines classification methods. The details of the classifier and the parallelisation of classification, learning and prediction phases are described. The classifier makes use of our method named one-vs-near. It is...
Improving css-KNN Classification Performance by Shifts in Training Data
Publikacja
- K. Draszawka
- J. Szymański
- F. Guerra
- Rok 2015
This paper presents a new approach to improve the performance of a css-k-NN classifier for categorization of text documents. The css-k-NN classifier (i.e., a threshold-based variation of a standard k-NN classifier we proposed in [1]) is a lazy-learning instance-based classifier. It does not have parameters associated with features and/or classes of objects, that would be optimized during off-line learning. In this paper we propose...
Study of Statistical Text Representation Methods for Performance Improvement of a Hierarchical Attention Network
Publikacja
- A. Wawrzyński
- J. Szymański
- Applied Sciences-Basel - Rok 2021
To effectively process textual data, many approaches have been proposed to create text representations. The transformation of a text into a form of numbers that can be computed using computers is crucial for further applications in downstream tasks such as document classification, document summarization, and so forth. In our work, we study the quality of text representations using statistical methods and compare them to approaches...

Pełny tekst do pobrania w portalu
Improving the Accuracy in Sentiment Classification in the Light of Modelling the Latent Semantic Relations
Publikacja
- N. Rizun
- W. Waloszek
- Y. Taranenko
- Information - Rok 2018
The research presents the methodology of improving the accuracy in sentiment classification in the light of modelling the latent semantic relations (LSR). The objective of this methodology is to find ways of eliminating the limitations of the discriminant and probabilistic methods for LSR revealing and customizing the sentiment classification process (SCP) to the more accurate recognition of text tonality. This objective was achieved...

Pełny tekst do pobrania w portalu
The Method of a Two-Level Text-Meaning Similarity Approximation of the Customers’ Opinions
Publikacja
- N. Rizun
- P. Kapłański
- Y. Taranenko
- Studia Ekonomiczne. Zeszyty Naukowe Uniwersytetu Ekonomicznego w Katowicach - Rok 2016
The method of two-level text-meaning similarity approximation, consisting in the implementation of the classification of the stages of text opinions of customers and identifying their rank quality level was developed. Proposed and proved the significance of major hypotheses, put as the basis of the developed methodology, notably about the significance of suggestions about the existence of analogies between mathematical bases of...

Pełny tekst do pobrania w portalu
Behavioral state classification in epileptic brain using intracranial electrophysiology
Publikacja
- V. Kremen
- J. J. Duque
- B. Brinkmann
- B. M. Berry
- M. T. Kucewicz
- F. Khadjevand
- J. Van Gompel
- M. Stead
- E. K. ST.Louis
- G. A. Worrell
- Journal of Neural Engineering - Rok 2017
OBJECTIVE: Automated behavioral state classification can benefit next generation implantable epilepsy devices. In this study we explored the feasibility of automated awake (AW) and slow wave sleep (SWS) classification using wide bandwidth intracranial EEG (iEEG) in patients undergoing evaluation for epilepsy surgery. APPROACH: Data from seven patients (age [Formula: see text], 4 women) who underwent intracranial depth electrode...

Pełny tekst do pobrania w serwisie zewnętrznym
A survey of automatic speech recognition deep models performance for Polish medical terms
Publikacja
- Rok 2023
Among the numerous applications of speech-to-text technology is the support of documentation created by medical personnel. There are many available speech recognition systems for doctors. Their effectiveness in languages such as Polish should be verified. In connection with our project in this field, we decided to check how well the popular speech recognition systems work, employing models trained for the general Polish language....

Pełny tekst do pobrania w serwisie zewnętrznym
Contextual ontology for tonality assessment
Publikacja
- W. Waloszek
- N. Rizun
- Procedia Computer Science - Rok 2020
classification tasks. The discussion focuses on two important research hypotheses: (1) whether it is possible to construct such an ontology from a corpus of textual document, and (2) whether it is possible and beneficial to use inferencing from this ontology to support the process of sentiment classification. To support the first hypothesis we present a method of extraction of hierarchy of contexts from a set of textual documents...

Pełny tekst do pobrania w portalu
How Specific Can We Be with k-NN Classifier?
Publikacja
- K. Draszawka
- J. Szymański
- Rok 2014
This paper discusses the possibility of designing a two stage classifier for large-scale hierarchical and multilabel text classification task, that will be a compromise between two common approaches to this task. First of it is called big-bang, where there is only one classifier that aims to do all the job at once. Top-down approach is the second popular option, in which at each node of categories’ hierarchy, there is a flat classifier...

Pełny tekst do pobrania w serwisie zewnętrznym
Improvement of Image Binarization Methods Using Image Preprocessing with Local Entropy Filtering for Alphanumerical Character Recognition Purposes
Publikacja
- H. Michalak
- K. P. Okarma
- ENTROPY - Rok 2019
Automatic text recognition from the natural images acquired in uncontrolled lighting conditions is a challenging task due to the presence of shadows hindering the shape analysis and classification of individual characters. Since the optical character recognition methods require prior image binarization, the application of classical global thresholding methods in such case makes it impossible to preserve the visibility of all...

Pełny tekst do pobrania w serwisie zewnętrznym
Clinical situations text database for Polish language
Dane Badawcze
open access
- A. Czyżewski
- D. Szplit
- J. Bogdan
- B. Graff
- K. Narkiewicz
- K. Marciniuk
- A. Harasimiuk
- P. Odya
- seria: ADMEDVOICE
Dataset contains a database of anonymized texts in Polish for the purposes of building a medical speech corpus, for clinical situations in the following areas: medical interview, interview and description of the result of an oncological examination, description of a radiological examination, description of a pathomorphological examination, description...
Fusion-based Representation Learning Model for Multimode User-generated Social Network Content
Publikacja
- A. M. Soomar
- ACM Journal of Data and Information Quality - Rok 2023
As mobile networks and APPs are developed, user-generated content (UGC), which includes multi-source heterogeneous data like user reviews, tags, scores, images, and videos, has become an essential basis for improving the quality of personalized services. Due to the multi-source heterogeneous nature of the data, big data fusion offers both promise and drawbacks. With the rise of mobile networks and applications, UGC, which includes...

Pełny tekst do pobrania w serwisie zewnętrznym
Text classifiers for automatic articles categorization
Publikacja
- Rok 2012
The article concerns the problem of automatic classification of textual content. We present selected methods for generation of documents representation and we evaluate them in classification tasks. The experiments have been performed on Wikipedia articles classified automatically to their categories made by Wikipedia editors.
Enhancing Word Embeddings for Improved Semantic Alignment
Publikacja
- Applied Sciences-Basel - Rok 2024
This study introduces a method for the improvement of word vectors, addressing the limitations of traditional approaches like Word2Vec or GloVe through introducing into embeddings richer semantic properties. Our approach leverages supervised learning methods, with shifts in vectors in the representation space enhancing the quality of word embeddings. This ensures better alignment with semantic reference resources, such as WordNet....

Pełny tekst do pobrania w serwisie zewnętrznym
An extension to the FEEDB Multimodal Database of Facial Expressions and Emotions
Publikacja
- M. Szwoch
- L. Marco-gimenez
- M. Arevalillo-herráez
- A. Ayesh
- Rok 2015
FEEDB is a multimodal database that contains recordings of people expressing different emotions, captured by using a Microsoft Kinect sensor. Data were originally provided in the device’s proprietary format (XED), requiring both the Microsoft Kinect Studio application and a Kinect sensor attached to the system to use the files. In this paper, we present an extension of the database. For a selection of recordings, we also provide...

Pełny tekst do pobrania w serwisie zewnętrznym
Evaluation of a company’s image on social media using the Net Sentiment Rate
Publikacja
- A. Baj-Rogowska
- Rok 2020
Vast amounts of new types of data are constantly being created as a result of dynamic digitization in all areas of our lives. One of the most important and valuable categories for business is data from social networks such as Facebook. Feedback resulting from the sharing of thoughts and emotions, expressed in comments on various products and services, is becoming the key factor on which modern business is based. This feedback is...

Pełny tekst do pobrania w serwisie zewnętrznym
Selecting Features with SVM
Publikacja
- J. Rzeniewicz
- J. Szymański
- Rok 2013
A common problem with feature selection is to establish how many features should be retained at least so that important information is not lost. We describe a method for choosing this number that makes use of Support Vector Machines. The method is based on controlling an angle by which the decision hyperplane is tilt due to feature selection. Experiments were performed on three text datasets generated from a Wikipedia dump. Amount...

Pełny tekst do pobrania w serwisie zewnętrznym
A system for Direction-Of-Arrival estimation in ISM 2.4 GHz frequency band based on ESPAR antenna and SDR technology
Publikacja
- P. Kwapisiewicz
- Rok 2018
Determination of the direction of the signal arrival (DOA) finds many applications in various areas of science and industry. Knowledge of DOA is used, among others to determine the position of a satellite with a low Earth orbit (LEO), localization of people and things as well as in research of wireless communication systems, for instance the determination of the number of...

Pełny tekst do pobrania w portalu
Towards Healthcare Cloud Computing
Publikacja
- Rok 2016
In this paper we present construction of a software platform for supporting medical research teams, in the area of impedance cardiography, called IPMed. Using the platform, research tasks will be performed by the teams through computer-supported cooperative work. The platform enables secure medical data storing, access to the data for research group members, cooperative analysis of medical data and provide analysis supporting tools...

Pełny tekst do pobrania w serwisie zewnętrznym
An automated learning model for twitter sentiment analysis using Ranger AdaBelief optimizer based Bidirectional Long Short Term Memory
Publikacja
- S. Natarajan
- S. Kurian
- P. Bidare Divakarachari
- P. Falkowski-Gilski
- EXPERT SYSTEMS - Rok 2024
Sentiment analysis is an automated approach which is utilized in process of analysing textual data to describe public opinion. The sentiment analysis has major role in creating impact in the day-to-day life of individuals. However, a precise interpretation of text still relies as a major concern in classifying sentiment. So, this research introduced Bidirectional Long Short Term Memory with Ranger AdaBelief Optimizer (Bi-LSTM RAO)...

Pełny tekst do pobrania w serwisie zewnętrznym
TF-IDF weighted bag-of-words preprocessed text documents from Simple English Wikipedia
Dane Badawcze
open access
The SimpleWiki2K-scores dataset contains TF-IDF weighted bag-of-words preprocessed text documents (raw strings are not available) [feature matrix] and their multi-label assignments [label-matrix]. Label scores for each document are also provided for an enhanced multi-label KNN [1] and LEML [2] classifiers. The aim of the dataset is to establish a benchmark...
Framework of communication tools for an international company - internal communication
Publikacja
- A. Pegani
- Rok 2025
The following article presents a set of communication tools that a joint company can take advantage of. The publication begins with a description of the historical background of communication, followed by a classification of communication tools into internal and external categories, along with brief characteristics of each. It indicates the purpose of implementing communication tools and the impact...
Framework of communication tools for an international company - external communication
Publikacja
- A. Pegani
- Rok 2025
The following article presents a set of communication tools that a joint company can take advantage of. The publication begins with a description of the historical background of communication, followed by a classification of communication tools into internal and external categories, along with brief characteristics of each. It indicates the purpose of implementing communication tools and the impact...
External Validation Measures for Nested Clustering of Text Documents
Publikacja
- K. Draszawka
- J. Szymański
- Rok 2011
Abstract. This article handles the problem of validating the results of nested (as opposed to "flat") clusterings. It shows that standard external validation indices used for partitioning clustering validation, like Rand statistics, Hubert Γ statistic or F-measure are not applicable in nested clustering cases. Additionally to the work, where F-measure was adopted to hierarchical classification as hF-measure, here some methods to...
Identification of category associations using a multilabel classifier
Publikacja
- J. Szymański
- J. Rzeniewicz
- EXPERT SYSTEMS WITH APPLICATIONS - Rok 2016
Description of the data using categories allows one to describe it on a higher abstraction level. In this way, we can operate on aggregated groups of the information, allowing one to see relationships that do not appear explicit when we analyze the individual objects separately. In this paper we present automatic identification of the associations between categories used for organization of the textual data. As experimental data...

Pełny tekst do pobrania w serwisie zewnętrznym
High Performance Control of AC Drives with Matlab/ Simulink
Publikacja
- A. Haitham
- A. Iqbal
- J. Guziński
- Rok 2021
Explore this indispensable update to a popular graduate text on electric drive techniques and the latest converters used in industry. The Second Edition of High Performance Control of AC Drives with Matlab®/Simulink delivers an updated and thorough overview of topics central to the understanding of AC motor drive systems. The book includes new material on medium voltage drives, covering state-of-the-art technologies and challenges...

Pełny tekst do pobrania w serwisie zewnętrznym
Fuel, Oils and Greases, W, E, sem.01, zimowy 22/23
Kursy Online
Division and origin of fuels. Fossil energy resources in Poland and in the world. Production and structure of fuel consumption. Main directions of crude oil processing. Classification and physical properties of gaseous and liquid fuels - natural gas, gasoline, kerosene, diesel oil, heating oil. Classification and characteristic indicators of solid fuels - hard coal, lignite, peat. Fuel contaminants and methods of their removal....
Fuel, Oils and Greases, W, E, sem.03, zimowy 22/23
Kursy Online
- P. Bzura
Division and origin of fuels. Fossil energy resources in Poland and in the world. Production and structure of fuel consumption. Main directions of crude oil processing. Classification and physical properties of gaseous and liquid fuels - natural gas, gasoline, kerosene, diesel oil, heating oil. Classification and characteristic indicators of solid fuels - hard coal, lignite, peat. Fuel contaminants and methods of their removal....
Operational Research - Queuing Systems 2022
Kursy Online
- J. Konorski
Components, characteristics, classification, analysis, and applications of queuing systems.
Orken Mamyrbayev Professor

Osoby

1. Education: Higher. In 2001, graduated from the Abay Almaty State University (now Abay Kazakh National Pedagogical University), in the specialty: Computer science and computerization manager. 2. Academic degree: Ph.D. in the specialty "6D070300-Information systems". The dissertation was defended in 2014 on the topic: "Kazakh soileulerin tanudyn kupmodaldy zhuyesin kuru". Under my supervision, 16 masters, 1 dissertation...
Operational Research - Queuing Systems 2024
Kursy Online
- J. Konorski
Course discusses components, characteristics, classification, analysis, and applications of queuing systems.
Operational Research - Queuing Systems 2024
Kursy Online
Course discusses components, characteristics, classification, analysis, and applications of queuing systems.
Operational Research 2024
Kursy Online
Course discusses components, characteristics, classification, analysis, and applications of queuing systems.
Industrial Heritage in sacrifice zones, The potential of Bocamina I & II Thermoelectric in Coronel, Chile
Publikacja
- M. A. Delso Páez
- K. Krośnicka
- Przestrzeń Ekonomia Społeczeństwo - Rok 2022
This work aims to present the recovery potential of the Chilean Sacrifice Zones, urban areas affected by high amounts of pollution caused by industrial activities. It centers in the case of “Bocamina I & II”, two Thermoelectric based in the city of Coronel, southern Chile. A settlement historically related to the mining processes. These plants operated for decades supplying the national energy...

Pełny tekst do pobrania w portalu
A study of nighttime vehicle detection algorithms
Dane Badawcze
wersja 1.0 open access
- J. Kwiatkowski
This dataset is from my master's thesis "A study of nighttime vehicle detection algorithms". It contains both raw data and preprocessed dataset ready to use. In the pictures below you can see how images were annotated.
Accidents, victims and risk levels on regional roads in pomorskie voivodeship, 2017-2019 - All accidents
Dane Badawcze
open access
- seria: Risk classification for selected types of road accidents on regional roads (voivodeship roads) in pomorskie voivodeship in 2017-2019
Data contain risk classification on regional roads (voivodeship roads) in pomorskie voivodeship in 2017-2019. Measures used to assess the level of risk are (5 classes low, low to medium, medium, medium to high, high):
Hydro and marine civil engineering
Kursy Online
- W. Sterpejkowicz-Wersocki
- A. M. Mustafa
- W. Magda
Kurs realizowany na Wydziale Inżynierii Lądowej i Środowiska (WILiŚ) Politechniki Gdańskiej. Studia: II stopnia - magisterskie, stacjonarne Kierunek: Budownictwo Semestr: 1 (letni) Presentation of basic hydro and marine civil engineering structures together with basic computational procedures for determining environmental forces acting on a structure (vertical-wall breakwater, rubble mound breakwater, submarine pipelines...
Hydro and marine civil engineering (2020-2021)
Kursy Online
- W. Sterpejkowicz-Wersocki
- A. M. Mustafa
- W. Magda
Kurs realizowany na Wydziale Inżynierii Lądowej i Środowiska (WILiŚ) Politechniki Gdańskiej. Studia: II stopnia - magisterskie, stacjonarne Kierunek: Budownictwo Semestr: 1 (letni) Presentation of basic hydro and marine civil engineering structures together with basic computational procedures for determining environmental forces acting on a structure (vertical-wall breakwater, rubble mound breakwater, submarine pipelines...
Hydro and marine civil engineering (2021-2022)
Kursy Online
- W. Sterpejkowicz-Wersocki
- A. M. Mustafa
- W. Magda
Kurs realizowany na Wydziale Inżynierii Lądowej i Środowiska (WILiŚ) Politechniki Gdańskiej. Studia: II stopnia - magisterskie, stacjonarne Kierunek: Budownictwo Semestr: 1 (letni) Presentation of basic hydro and marine civil engineering structures together with basic computational procedures for determining environmental forces acting on a structure (vertical-wall breakwater, rubble mound breakwater, submarine pipelines...
Bias mitigation benchmark that includes two datasets
Dane Badawcze
open access
ISIC-2020 is the largest skin lesion dataset divided into two classes -- benign and malignant. It contains 33126 dermoscopic images from over 2000 patients. The diagnoses were confirmed either by histopathology, expert agreement or longitudinal follow-up. The dataset was gathered by The International Skin Imaging Collaboration (ISIC) from several medical...
Accidents, victims and risk levels on regional roads in pomorskie voivodeship, 2017-2019 - Pedestrian accidents
Dane Badawcze
open access
- seria: Risk classification for selected types of road accidents on regional roads (voivodeship roads) in pomorskie voivodeship in 2017-2019
Data contain risk classification on regional roads (voivodeship roads) in pomorskie voivodeship in 2017-2019, risk group: Pedestrians. Measures used to assess the level of risk are (5 classes low, low to medium, medium, medium to high, high):
Accidents, victims and risk levels on regional roads in pomorskie voivodeship, 2017-2019 - Motorcycle and moped accidents
Dane Badawcze
open access
- seria: Risk classification for selected types of road accidents on regional roads (voivodeship roads) in pomorskie voivodeship in 2017-2019
Data contain risk classification on regional roads (voivodeship roads) in pomorskie voivodeship in 2017-2019, risk group: motorcyclists and mopeds. Measures used to assess the level of risk are (5 classes low, low to medium, medium, medium to high, high):
Accidents, victims and risk levels on regional roads in pomorskie voivodeship, 2017-2019 - Head-on accidents
Dane Badawcze
open access
- seria: Risk classification for selected types of road accidents on regional roads (voivodeship roads) in pomorskie voivodeship in 2017-2019
Data contain risk classification on regional roads (voivodeship roads) in pomorskie voivodeship in 2017-2019, type of accidents: head-on. Measures used to assess the level of risk are (5 classes low, low to medium, medium, medium to high, high):
Accidents, victims and risk levels on regional roads in pomorskie voivodeship, 2017-2019 - Child accidents
Dane Badawcze
open access
- seria: Risk classification for selected types of road accidents on regional roads (voivodeship roads) in pomorskie voivodeship in 2017-2019
Data contain risk classification on regional roads (voivodeship roads) in pomorskie voivodeship in 2017-2019, risk group: children - drivers, passengers and . vulnerable road user.. Measures used to assess the level of risk are (5 classes low, low to medium, medium, medium to high, high):

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: TEXT CLASSIFICATION

Orken Mamyrbayev Professor