Search results for: NATURAL LANGUAGE PROCESSING, LARGE LANGUAGE MODELS, DATA MINING, QUANTUM PHYSICS
-
Extracting concepts from the software requirements specification using natural language processing
PublicationExtracting concepts from the software require¬ments is one of the first step on the way to automating the software development process. This task is difficult due to the ambiguity of the natural language used to express the requirements specification. The methods used so far consist mainly of statistical analysis of words and matching expressions with a specific ontology of the domain in which the planned software will be applicable....
-
Marek Czachor prof. dr hab.
People -
Evaluation of ChatGPT Applicability to Learning Quantum Physics
PublicationChatGPT is an application that uses a large language model. Its purpose is to generate answers to various questions as well as provide information, help solve problems and participate in conversations on a wide range of topics. This application is also widely used by students for the purposes of learning or cheating (e.g., writing essays or programming codes). Therefore, in this contribution, we evaluate the ability of ChatGPT...
-
Big Data i 5V – nowe wyzwania w świecie danych (Big Data and 5V – New Challenges in the World of Data)
PublicationRodzaje danych, składające się na zbiory typu Big Data, to m.in. dane generowane przez użytkowników portali internetowych, dane opisujące transakcje dokonywane poprzez Internet, dane naukowe (biologiczne, astronomiczne, pomiary fizyczne itp.), dane generowane przez roboty w wyniku automatycznego przeszukiwania przez nie Internetu (Web mining, Web crawling), dane grafowe obrazujące powiązania pomiędzy stronami WWW itd. Zazwyczaj,...
-
Language Models in Speech Recognition
PublicationThis chapter describes language models used in speech recognition, It starts by indicating the role and the place of language models in speech recognition. Mesures used to compare language models follow. An overview of n-gram, syntactic, semantic, and neural models is given. It is accompanied by a list of popular software.
-
Music Data Processing and Mining in Large Databases for Active Media
PublicationThe aim of this paper was to investigate the problem of music data processing and mining in large databases. Tests were performed on a large data-base that included approximately 30000 audio files divided into 11 classes cor-responding to music genres with different cardinalities. Every audio file was de-scribed by a 173-element feature vector. To reduce the dimensionality of data the Principal Component Analysis (PCA) with variable...
-
Knowledge Base Suitable for Answering Questions in Natural Language
PublicationThis paper presents three knowledge bases widely used by researchers coping with natural language processing: OpenCyc, DBpedia and YAGO. They are characterized from the point of view of questions answering system. In this paper a short description of the aforementioned system implementation is also presented.
-
Mohsan Ali Master of Science in Computer Science
PeopleMohsan Ali is a researcher at the University of the Aegean. He won the Marie-Curie Scholarship in 2021 in the field of open data ecosystem (ODECO) to pursue his PhD degree at the University of the Aegean. Currently, he is working on the technical interoperability of open data in the information systems laboratory; this position is funded by ODECO. His areas of expertise are open data, open data interoperability, data science, natural...
-
OrphaGPT: An Adapted Large Language Model for Orphan Diseases Classification
PublicationOrphan diseases (OD) represent a category of rare conditions that affect only a relatively small number of individuals. These conditions are often neglected in research due to the challenges posed by their scarcity, making medical advancements difficult. Then, the ever-evolving medical research and diagnosis landscape calls for more attention and innovative approaches to address the complex challenges of rare diseases and OD. Pre-trained...
-
Information Extraction from Polish Radiology Reports using Language Models
PublicationRadiology reports are vital elements of directing patient care. They are usually delivered in free text form, which makes them prone to errors, such as omission in reporting radiological findings and using difficult-to-comprehend mental shortcuts. Although structured reporting is the recommended method, its adoption continues to be limited. Radiologists find structured reports too limiting and burdensome. In this paper, we propose...
-
DBpedia and YAGO as Knowledge Base for Natural Language Based Question Answering—The Evaluation
PublicationThe idea of automatic question answering system has a very long history. Despite constant improvement of the systems asking questions in the natural language requires very complex solutions. In this paper the DBpedia and YAGO are evaluated as a knowledge bases for simple class 1 and 2 question answering system. For this purpose a question answering system was designed and implemented. The proposed solution and the knowledge bases...
-
Natural language dictionaries implemented as finite automata
PublicationRozdział przedstawia wykorzystanie automatów skończonych jako słowników języka naturalnego. Podane są podstawy teoretyczne. Omówione są zastosowania: realizacja doskonałej funkcji mieszającej, analizy i syntezy morfologicznej, poprawiania pisowni i dopisywania znaków diakrytycznych, wydobywanie informacji. Podano algorytmy tworzenia automatów oraz omówiono sposoby reprezentacji automatów z uwzględnieniem kompresji.
-
Fluent Editor and Controlled Natural Language in Ontology Development
Publication -
Semantic rules representation in controlled natural language in FluentEditor
PublicationThis paper presents a way of representation of semantic rules (SWRL) in controlled English in order to facilitate understanding the rules by humans interacting with a machine. This approach (implemented in FluentEditor) may be applied in many domains, where the understandability of the rules used to support a decision process is of great importance.
-
Finite automata for compact representation of language models in NLP
PublicationPrzedstawiona zostaje technika reprezentacji modeli języka w przetwarzaniu języka naturalnego wymagająca mało pamięci. Po krótkim omówieniu przyczyn poszukiwania oszczędnej reprezentacji takich modeli języka, pokazane jest, jak automaty skończone mogą być użyte w tym celu. Technika może być postrzegana jako zastosowanie i rozszerzenie doskonałej funkcji mieszającej z wykorzystaniem automatów skończonych. Pierwsze doświadczenia...
-
DBpedia and YAGO Based System for Answering Questions in Natural Language
PublicationIn this paper we propose a method for answering class 1 and class 2 questions (out of 5 classes defined by Moldovan for TREC conference) based on DBpedia and YAGO. Our method is based on generating dependency trees for the query. In the dependency tree we look for paths leading from the root to the named entity of interest. These paths (referenced further as fibers) are candidates for representation of actual user intention. The...
-
Visual Low-Code Language for Orchestrating Large-Scale Distributed Computing
Publication -
Evaluation of Multimedia Stream Processing Modeling Language from the Perspective of Cognitive Dimensions
PublicationW referacie zawarto opis zastosowania wymiarów poznawczych do oceny języka modelowania przetwarzania strumieni multimedialnych, nazwanego MSP-ML, w trakcie tworzenia tego języka. Poszczególne części referatu prezentują kontekst i motywacje oceny MSP-ML, metodę oceny, rezultaty oceny oraz porównanie rezultatów oceny z wynikami otrzymanymi za pomocą innych metod oceny języków modelowania wizualnego.
-
Using LSTM networks to predict engine condition on large scale data processing framework
PublicationAs the Internet of Things technology is developing rapidly, companies have an ability to observe the health of engine components and constructed systems through collecting signals from sensors. According to output of IoT sensors, companies can build systems to predict the conditions of components. Practically the components are required to be maintained or replaced before the end of life in performing their assigned task. Predicting...
-
Krzysztof Goczyła prof. dr hab. inż.
PeopleKrzysztof Goczyła, full professor of Gdańsk University of Technology, computer scientist, a specialist in software engineering, knowledge engineering and databases. He graduated from the Faculty of Electronics Technical University of Gdansk in 1976 with a degree in electronic engineering, specializing in automation. Since then he has been working at Gdańsk University of Technology. In 1982 he obtained a doctorate in computer science...
-
Natural Language Semantics
Journals -
Comparison of Language Models Trained on Written Texts and Speech Transcripts in the Context of Automatic Speech Recognition
Publication -
Path integrals formulations leading to propagator evaluation for coupled linear physics in large geometric models
PublicationReformulating linear physics using second kind Fredholm equations is very standard practice. One of the straightforward consequences is that the resulting integrals can be expanded (when the Neumann expansion converges) and probabilized, leading to path statistics and Monte Carlo estimations. An essential feature of these algorithms is that they also allow to estimate propagators for all types of sources, including initial conditions....
-
Sylwester Kaczmarek dr hab. inż.
PeopleSylwester Kaczmarek received his M.Sc in electronics engineering, Ph.D. and D.Sc. in switching and teletraffic science from the Gdansk University of Technology, Gdansk, Poland, in 1972, 1981 and 1994, respectively. His research interests include: IP QoS and GMPLS and SDN networks, switching, QoS routing, teletraffic, multimedia services and quality of services. Currently, his research is focused on developing and applicability...
-
Natural Language & Linguistic Theory
Journals -
Data Acquisition and Processing for GeoAI Models to Support Sustainable Agricultural Practices
PublicationThere are growing opportunities to leverage new technologies and data sources to address global problems related to sustainability, climate change, and biodiversity loss. The emerging discipline of GeoAI resulting from the convergence of AI and Geospatial science (Geo-AI) is enabling the possibility to harness the increasingly available open Earth Observation data collected from different constellations of satellites and sensors...
-
Using LSTM networks to predict engine condition on large scale data processing framework
Publication -
IEEE Transactions on Audio Speech and Language Processing
Journals -
Leszek Ziemczonek dr
PeopleUniversity education 1973-1978 – Nicolaus Copernicus University in Toruń, University of Gdańsk in Gdańsk, Mathematical Physics, M. Sc. 1979 – Diploma of Postgraduate Studies, Pedagogics 1989 – Institute of Physics, Polish Academy of Sciences in Warsaw, Theoretical Physics, Ph. D. 2010-2012 – Diploma of Postgraduate Studies, Mathematics Training: · 09.1983 – Trieste (Italy) – International Centre for Theoretical Physics...
-
IEEE-ACM Transactions on Audio Speech and Language Processing
Journals -
Intelligent information services 23/24
e-Learning CoursesInformation retrieval Text categorization Natural language processing
-
Empirical Methods in Natural Language Processing
Conferences -
Natural Language Processing and Knowledge Engineering
Conferences -
Applications of Natural Language to Data Bases
Conferences -
Ontology-Aided Software Engineering
PublicationThis thesis is located between the fields of research on Artificial Intelligence (AI), Knowledge Representation and Reasoning (KRR), Computer-Aided Software Engineering (CASE) and Model Driven Engineering (MDE). The modern offspring of KRR - Description Logic (DL) [Baad03] is considered here as a formalization of the software engineering Methods & Tools. The bridge between the world of formal specification (governed by the mathematics)...
-
Paweł Możejko dr hab.
People -
ACM Transactions on Asian and Low-Resource Language Information Processing
Journals -
International Joint Conference on Natural Language Processing
Conferences -
Natural Language Engineering
Journals -
Joint Conference on New Methods in Language Processing and Computational Natural Language Learning
Conferences -
Conference on Computational Natural Language Learning (Conference on Natural Language Learning)
Conferences -
International Conference on Recent Advances in Natural Language Processing
Conferences -
Computer controlled systems - 2022/2023
e-Learning Coursesmateriały wspierające wykład na studiach II stopnia na kierunku ACR pod tytułem komputerowe systemy automatyki 1. Computer system – controlled plant interfacing technique; simple interfacing and with both side acknowledgement; ideas, algorithms, acknowledge passing. 2. Methods of acknowledgement passing: software checking and passing, using interrupt techniques, using readiness checking (ready – wait lines). The best solution...
-
CCS-lecture-2023-2024
e-Learning Coursesmateriały wspierające wykład na studiach II stopnia na kierunku ACR pod tytułem komputerowe systemy automatyki 1. Computer system – controlled plant interfacing technique; simple interfacing and with both side acknowledgement; ideas, algorithms, acknowledge passing. 2. Methods of acknowledgement passing: software checking and passing, using interrupt techniques, using readiness checking (ready – wait lines). The best solution optimization...
-
Towards facts extraction from text in Polish language
PublicationNatural Language Processing (NLP) finds many usages in different fields of endeavor. Many tools exists allowing analysis of English language. For Polish language the situation is different as the language itself is more complicated. In this paper we show differences between NLP of Polish and English language. Existing solutions are presented and TEAMS software for facts extraction is described. The paper shows also evaluation of...
-
Text-mining Similarity Approximation Operators for Opinion Mining in BI tools
PublicationThe concept of the Text-mining Similarity Approximation Operators for Opinion Mining as extensions to Natural Language Interface Database is defined. The new operators: “keywords of” dimension; subsetting operator “about C is q”; aggregation operator “by similar C” are proposed. These operators are based on the Latent Semantic Analysis and Social Network Analysis
-
Semantic OLAP with FluentEditor and Ontorion Semantic Excel Toolchain
PublicationSemantic technologies appear as a step on the way to creating systems capable of representing the physical world as real time computational processes. In this context, the paper presents a toolchain for an ontology based knowledge management system. It consists of the ontology editor, FluentEditor and the distributed knowledge representation system, Ontorion. FluentEditor is a comprehensive tool for editing and manipulating complex...
-
Language, Data and Knowledge
Conferences -
Piotr Krajewski dr
PeoplePiotr Krajewski is a librarian at the Library of Gdańsk University of Technology (GUT) and a PhD student at the Medical University of Gdańsk. His research interests focus on the standardization of the e-resources usage data and Open Access publishing, especially the role of institutional repositories in the development of the OA initiative and the phenomenon of “predatory publishers”. He works at Scientific and Technical Information...
-
Logic and Engineering of Natural Language Semantics
Conferences