Search results for: large language models - Bridge of Knowledge

Search

Search results for: large language models

Search results for: large language models

  • Language Models in Speech Recognition

    Publication

    - Year 2022

    This chapter describes language models used in speech recognition, It starts by indicating the role and the place of language models in speech recognition. Mesures used to compare language models follow. An overview of n-gram, syntactic, semantic, and neural models is given. It is accompanied by a list of popular software.

    Full text to download in external service

  • OrphaGPT: An Adapted Large Language Model for Orphan Diseases Classification

    Publication

    - Year 2024

    Orphan diseases (OD) represent a category of rare conditions that affect only a relatively small number of individuals. These conditions are often neglected in research due to the challenges posed by their scarcity, making medical advancements difficult. Then, the ever-evolving medical research and diagnosis landscape calls for more attention and innovative approaches to address the complex challenges of rare diseases and OD. Pre-trained...

    Full text to download in external service

  • Finite automata for compact representation of language models in NLP

    Publication

    Przedstawiona zostaje technika reprezentacji modeli języka w przetwarzaniu języka naturalnego wymagająca mało pamięci. Po krótkim omówieniu przyczyn poszukiwania oszczędnej reprezentacji takich modeli języka, pokazane jest, jak automaty skończone mogą być użyte w tym celu. Technika może być postrzegana jako zastosowanie i rozszerzenie doskonałej funkcji mieszającej z wykorzystaniem automatów skończonych. Pierwsze doświadczenia...

  • Visual Low-Code Language for Orchestrating Large-Scale Distributed Computing

    Publication
    • K. Rybiński
    • M. Śmiałek
    • A. Sostaks
    • K. Marek
    • R. Roszczyk
    • M. Wdowiak

    - Journal of Grid Computing - Year 2023

    Full text to download in external service

  • Information Extraction from Polish Radiology Reports using Language Models

    Publication

    Radiology reports are vital elements of directing patient care. They are usually delivered in free text form, which makes them prone to errors, such as omission in reporting radiological findings and using difficult-to-comprehend mental shortcuts. Although structured reporting is the recommended method, its adoption continues to be limited. Radiologists find structured reports too limiting and burdensome. In this paper, we propose...

    Full text available to download

  • Comparison of Language Models Trained on Written Texts and Speech Transcripts in the Context of Automatic Speech Recognition

    Publication
    • S. Dziadzio
    • A. Nabożny
    • A. Smywiński-Pohl
    • B. Ziółko

    - Year 2015

    Full text to download in external service

  • Design and experimental evaluation of pod propulsor models for a large self-propelled ship model.

    Publication

    Artykuł przedstawia serię hydrodynamicznych badań swobodnych dwóch modeli pędnika podowego. Pędniki zostały zaprojektowane i zbudowane specjalnie dla dwóch wersji dużego modelu okrętu z własnym napędem, przeznaczonego do eksperymentów manewrowych. Jedna wersja jest napędzana pojedyńczym pędnikiem, druga jest wyposażona w dwa pędniki. Oba modele podów były badane w kanale obiegowym. Celem eksperymentu były pomiary sześciu składowych...

  • Path integrals formulations leading to propagator evaluation for coupled linear physics in large geometric models

    Publication

    - COMPUTER PHYSICS COMMUNICATIONS - Year 2024

    Reformulating linear physics using second kind Fredholm equations is very standard practice. One of the straightforward consequences is that the resulting integrals can be expanded (when the Neumann expansion converges) and probabilized, leading to path statistics and Monte Carlo estimations. An essential feature of these algorithms is that they also allow to estimate propagators for all types of sources, including initial conditions....

    Full text to download in external service

  • Performance of various risk prediction models in a large lung cancer screening cohort in Gdańsk, Poland—a comparative study

    Publication
    • M. Ostrowski
    • F. Bińczyk
    • T. Marjański
    • R. Dziedzic
    • S. Pisiak
    • S. Małgorzewicz
    • M. Adamek
    • J. Polańska
    • W. Rzyman

    - Translational Lung Cancer Research - Year 2021

    Full text to download in external service

  • WikiPrefs: human preferences dataset build from text edits

    Open Research Data

    The WikiPrefs dataset is a human preferences dataset for Large Language Models alignment. It was built using the EditPrefs method from historical edits of Wikipedia featured articles

  • The American Sign Language alphabet

    Open Research Data
    open access

    The American Sign Language dataset contains all static letters of the American alphabet, meaning those that do not require movement to perform (the entire alphabet except for the letters 'J' and 'Z', which are dynamic and require hand movement).

  • Robert Piotrowski dr hab. inż.

    Robert Piotrowski jest absolwentem Wydziału Elektrotechniki i Automatyki (2001r., kierunek: Automatyka i Robotyka) oraz Wydziału Zarządzania i Ekonomii (2002r., kierunek: Organizacja Systemów Produkcyjnych) Politechniki Gdańskiej. Od 2005 roku jest zatrudniony na Wydziale Elektrotechniki i Automatyki, aktualnie w Katedrze Inteligentnych Systemów Sterowania i Wspomagania Decyzji. W 2005 roku obronił rozprawę doktorską (Automatyka...

  • Quantifying inconsistencies in the Hamburg Sign Language Notation System

    Publication
    • M. Ferlin
    • S. Majchrowska
    • M. A. Plantykow
    • A. Kwaśniewska
    • A. Mikołajczyk-Bareła
    • M. Olech
    • J. Nalepa

    - EXPERT SYSTEMS WITH APPLICATIONS - Year 2024

    The advent of machine learning (ML) has significantly advanced the recognition and translation of sign languages, bridging communication gaps for hearing-impaired communities. At the heart of these technologies is data labeling, crucial for training ML algorithms on a huge amount of consistently labeled data to achieve models that generalize well. The adoption of language-agnostic annotations is essential to connect different sign...

    Full text to download in external service

  • Beyond Traditional Learning: The LLM Revolution in BPM Education at University

    Publication

    - Year 2024

    Large Language Models (LLMs) significantly impact higher education, requiring changes in educational processes, especially in Business Process Management (BPM) practical exercises. The research aims to evaluate the effectiveness of LLMs in BPM education to determine if LLMs can supplement educators. The study involved 33 master’s degree students. Students’ works were manually evaluated and compared to LLM-generated responses. Results...

    Full text to download in external service

  • News that Moves the Market: DSEX-News Dataset for Forecasting DSE Using BERT

    Publication

    - Year 2024

    Stock market is a complex and dynamic industry that has always presented challenges for stakeholders and investors due to its unpredictable nature. This unpredictability motivates the need for more accurate prediction models. Traditional prediction models have limitations in handling the dynamic nature of the stock market. Additionally, previous methods have used less relevant data, leading to suboptimal performance. This study...

    Full text to download in external service

  • Sign Language Recognition Using Convolution Neural Networks

    Publication

    The objective of this work was to provide an app that can automatically recognize hand gestures from the American Sign Language (ASL) on mobile devices. The app employs a model based on Convolutional Neural Network (CNN) for gesture classification. Various CNN architectures and optimization strategies suitable for devices with limited resources were examined. InceptionV3 and VGG-19 models exhibited negligibly higher accuracy than...

    Full text available to download

  • Modelling and simulation of GPU processing in the MERPSYS environment

    In this work, we evaluate an analytical GPU performance model based on Little's law, that expresses the kernel execution time in terms of latency bound, throughput bound, and achieved occupancy. We then combine it with the results of several research papers, introduce equations for data transfer time estimation, and finally incorporate it into the MERPSYS framework, which is a general-purpose simulator for parallel and distributed...

    Full text available to download

  • DEVELOPMENT OF THE ALGORITHM OF POLISH LANGUAGE FILM REVIEWS PREPROCESSING

    The algorithm and the software for conducting the procedure of Preprocessing of the reviews of films in the Polish language were developed. This algorithm contains the following steps: Text Adaptation Procedure; Procedure of Tokenization; Procedure of Transforming Words into the Byte Format; Part-of-Speech Tagging; Stemming / Lemmatization Procedure; Presentation of Documents in the Vector Form (Vector Space Model) Procedure; Forming...

    Full text available to download

  • An Analysis of Neural Word Representations for Wikipedia Articles Classification

    Publication

    - CYBERNETICS AND SYSTEMS - Year 2019

    One of the current popular methods of generating word representations is an approach based on the analysis of large document collections with neural networks. It creates so-called word-embeddings that attempt to learn relationships between words and encode this information in the form of a low-dimensional vector. The goal of this paper is to examine the differences between the most popular embedding models and the typical bag-of-words...

    Full text to download in external service

  • Towards facts extraction from text in Polish language

    Publication

    Natural Language Processing (NLP) finds many usages in different fields of endeavor. Many tools exists allowing analysis of English language. For Polish language the situation is different as the language itself is more complicated. In this paper we show differences between NLP of Polish and English language. Existing solutions are presented and TEAMS software for facts extraction is described. The paper shows also evaluation of...

    Full text available to download

  • Ontology of the Design Pattern Language for Smart Cities Systems

    Publication

    The paper presents the definition of the design pattern language of Smart Cities in the form of an ontology. Since the implementation of a Smart City system is difficult, expensive and closely linked with the problems concerning a given city, the knowledge acquired during a single implementation is extremely valuable. The language we defined supports the management of such knowledge as it allows for the expression of a solution...

    Full text to download in external service

  • Semantic modeling of contextual augmented reality environments

    Publication
    • D. Rumiński

    - Year 2018

    Despite significant progress in the field of augmented reality (AR), regarding both hardware and software, there is still a lack of universal models and methods that would enable building ubiquitous AR systems that could be used anywhere and anytime, covering different application areas. This dissertation describes a new approach to building AR systems, called the Contextual Augmented Reality Environment (CARE). The CARE approach...

    Full text available to download

  • The Impact of Foreign Accents on the Performance of Whisper Family Models Using Medical Speech in Polish

    Publication

    - Year 2024

    The article presents preliminary experiments investigating the impact of accent on the performance of the Whisper automatic speech recognition (ASR) system, specifically for the Polish language and medical data. The literature review revealed a scarcity of studies on the influence of accents on speech recognition systems in Polish, especially concerning medical terminology. The experiments involved voice cloning of selected individuals...

    Full text available to download

  • English Language Learning Employing Developments in Multimedia IS

    Publication

    In the realm of the development of information systems related to education, integrating multimedia technologies offers novel ways to enhance foreign language learning. This study investigates audio-video processing methods that leverage real-time speech rate adjustment and dynamic captioning to support English language acquisition. Through a mixed-methods analysis involving participants from a language school, we explore the impact...

    Full text to download in external service

  • A survey of automatic speech recognition deep models performance for Polish medical terms

    Among the numerous applications of speech-to-text technology is the support of documentation created by medical personnel. There are many available speech recognition systems for doctors. Their effectiveness in languages such as Polish should be verified. In connection with our project in this field, we decided to check how well the popular speech recognition systems work, employing models trained for the general Polish language....

    Full text to download in external service

  • An Approach to Trust Case Development

    Publication

    - Year 2003

    In the paper we present an approach to the architectural trust case development for DRIVE, the IT infrastructure supporting the processes of drugs distribution and application. The objectives of DRIVE included safer and cheaper drugs distribution and application. A trust case represents an argument supporting the trustworthiness of the system. It is decomposed into claims that postulate some trust related properties. Claims differ...

    Full text available to download

  • Towards Facts Extraction From Texts in Polish Language

    The Polish language differs from English in many ways. It has more complicated conjugation and declination. Because of that automatic facts extraction from texts is difficult. In this paper we present basic differences between those languages. The paper presents an algorithm for extraction of facts from articles from Polish Wikipedia. The algorithm is based on 7 proposed facts schemes that are searched for in the analyzed text....

    Full text available to download

  • Knowledge Base Suitable for Answering Questions in Natural Language

    Publication

    This paper presents three knowledge bases widely used by researchers coping with natural language processing: OpenCyc, DBpedia and YAGO. They are characterized from the point of view of questions answering system. In this paper a short description of the aforementioned system implementation is also presented.

    Full text to download in external service

  • PPAM 2022

    Events

    11-09-2022 07:00 - 14-09-2022 13:56

    The PPAM 2022 conference, will cover topics in parallel and distributed computing, including theory and applications, as well as applied mathematics.

  • Automatic Classification of Polish Sign Language Words

    In the article we present the approach to automatic recognition of hand gestures using eGlove device. We present the research results of the system for detection and classification of static and dynamic words of Polish language. The results indicate the usage of eGlove allows to gain good recognition quality that additionally can be improved using additional data sources such as RGB cameras.

    Full text available to download

  • MERPSYS: An environment for simulation of parallel application execution on large scale HPC systems

    In this paper we present a new environment called MERPSYS that allows simulation of parallel application execution time on cluster-based systems. The environment offers a modeling application using the Java language extended with methods representing message passing type communication routines. It also offers a graphical interface for building a system model that incorporates various hardware components such as CPUs, GPUs, interconnects...

    Full text available to download

  • Extracting concepts from the software requirements specification using natural language processing

    Publication

    - Year 2018

    Extracting concepts from the software require¬ments is one of the first step on the way to automating the software development process. This task is difficult due to the ambiguity of the natural language used to express the requirements specification. The methods used so far consist mainly of statistical analysis of words and matching expressions with a specific ontology of the domain in which the planned software will be applicable....

    Full text to download in external service

  • Language material for English audiovisual speech recognition system developmen . Materiał językowy do wykorzystania w systemie audiowizualnego rozpoznawania mowy angielskiej

    Publication

    - Year 2013

    The bi-modal speech recognition system requires a 2-sample language input for training and for testing algorithms which precisely depicts natural English speech. For the purposes of the audio-visual recordings, a training data base of 264 sentences (1730 words without repetitions; 5685 sounds) has been created. The language sample reflects vowel and consonant frequencies in natural speech. The recording material reflects both the...

  • A Model-Driven Solution for Development of Multimedia Stream Processing Applications

    Publication

    This paper presents results of action research related to model-driven solutions in the area of multimedia stream processing. The practical problem to be solved was the need to support application developers who make their multimedia stream processing applications in a supercomputer environment. The solution consists of a domain-specific visual language for composing complex services from simple services called Multimedia Stream...

    Full text to download in external service

  • Creating Polish space language dictionary - lessons learned

    Publication

    Polish space industry suffers from lack of space vocabulary. Since joining European Space Agency in 2012, the sector has expanded rapidly now employing over 1000 specialists focusing mainly on space sustainability, space debris detection and tracking, robotics and propulsion systems. The Polish Space Agency together with The Polish Committee for Standardization have committed to creating the first lexicon of space language, along...

    Full text available to download

  • Web Services Choreography Description Language - WSCDL.

    Publication

    - Year 2004

    Język Web Services Choreography Description Language służy do opisu współpracy równy z równym. Został zaprojektowany z myślą o automatyzacji współpracy usług sieciowych, ale jest na tyle ogólny, że pozwala opisywać współpracę nie tylko w świecie komputerowym. Prezentowana jest geneza tego języka oraz jego model. Następnie opisana jest struktura języka poparta przykładem dokumentu napisanego w języku WSCDL.

  • Language of Benefits as a Novel Tool for Improving Website Personalization

    A properly designed website allows the user to search for information faster, and more accurately. The information content of the website should be also adapted to the needs of the user. The purpose of this article is to present a novel Language of Benefits (LoB) approach to facilitate the use of websites for individual user groups. The LoB approach is an approach addressed to IT Analysts, to facilitate the process of web design,...

    Full text to download in external service

  • Terminological and Assertional Queries in KQL Knowledge Access Language

    One of the directions of development of information systems in recent years in the evolution of data-based systems into the knowledge-based systems. As a part of this process there is ongoing work on a whole range of languages for accessing knowledge bases. They can be used in a variety of applications, however their main drawback is the lack of clearly defined algebra representing a theoretical basis for them. For instance, such...

  • Analysis of economical lighting of highways in the environment of SMOL language

    The paper puts forward and implements a method of designing and creating a modelling simulation environment for eztensive and complete analysis of economical lighting on highways. From a general design viewpoint, the proposed solution explores the concept of a network description language (SMOL), which has been designed to describe the necessary network functions, mechanisms, and devices; for the purpose of their computer simulation...

    Full text available to download

  • The Use of the Language of Mathematics as an Inspiration for Contemporary Architectural Design

    Publication

    The purpose of the article is to present the evolution of the use of mathematical language as an inspiration for creating spatial, three-dimensional forms in art and architecture. The article focuses on the possibilities for art and architectural design ideas gained by contemporary mathematics, algorithms and computational parametric approach. The analysis of various examples represents the relationships between the composition...

    Full text available to download

  • Words context analysis for improvement of information retrieval

    Publication

    - Year 2012

    In the article we present an approach to improvement of retrieval informationfrom large text collections using words context vectors. The vectorshave been created analyzing English Wikipedia with Hyperspace Analogue to Language model of words similarity. For test phrases we evaluate retrieval with direct user queries as well as retrieval with context vectors of these queries. The results indicate that the proposed method can not...

  • Geometric Algebra Model of Distributed Representations

    Publication

    - Year 2010

    Formalism based on GA is an alternative to distributed representation models developed so far-Smolensky's tensor product, Holographic Reduced Representations (HRR) and Binary Spatter Code (BSC). Convolutions are replaced by geometric products, interpretable in terms of geometry which seems to be the most natural language for visualization of higher concepts. This paper recalls the main ideas behind the GA model and investigates...

  • DBpedia and YAGO as Knowledge Base for Natural Language Based Question Answering—The Evaluation

    The idea of automatic question answering system has a very long history. Despite constant improvement of the systems asking questions in the natural language requires very complex solutions. In this paper the DBpedia and YAGO are evaluated as a knowledge bases for simple class 1 and 2 question answering system. For this purpose a question answering system was designed and implemented. The proposed solution and the knowledge bases...

    Full text available to download

  • Deep neural networks for data analysis 24/25

    e-Learning Courses
    • J. Cychnerski
    • K. Draszawka

    This course covers introduction to supervised machine learning, construction of basic artificial deep neural networks (DNNs) and basic training algorithms, as well as the overview of popular DNNs architectures (convolutional networks, recurrent networks, transformers). The course introduces students to popular regularization techniques for deep models. Besides theory, large part of the course is the project in which students apply...

  • Modelling of the High Speed Multi-Pole Synchronous Generator for Application in More Electric Aircraft Power Systems

    Publication

    In this paper different models of the synchronous generator are presented. The simulation results compared with the measurements are shown. Certain physical phenomena are included in described models for the porpoise of adequate analysis of the more electric aircraft power system. For different modelling levels, such as functional level or behavioural level, different physical phenomena have been included. Simulation results for...

  • Description Logic As A Common Software Engineering Artifacts Language

    Publication

    - Year 2008

    Description logic is proposed as a powerful language able to support chosen software engineering process tasks like: requirements engineering, software architecture definition, software design and configuration management. To do this there is presented a correspondence between description logic and UML. Description logic based integrated software engineering process framework is proposed which owing to automatic knowledge inferring...

    Full text to download in external service

  • Badania empiryczne związane z ewolucją języków - wybrane zagadnienia

    Although language evolution is an area in science yet to be developed, its foundations lay on empirical research. The aim of this article is to present three categories of ways to get empirical data on language evolution: observing language in laboratory, monitoring animal communication and analysing pidgins and creoles. The part of the paper about language in laboratory bases on English-language articles presenting the experiments...

  • Individual Resources and Intercultural Interactions

    Publication

    - Year 2017

    The work environment in multinational corporations (MNCs) is specific and demanding including intercultural interactions with co-workers and clients and using a foreign language. Some individual resources can help in dealing with these circumstances. Individual resources refer to personal dispositions, competencies and prior experiences. With regard to previous studies, a caravan of personal resources, namely Psychological Capital...

  • Viability of decisional DNA in robotics

    Publication

    - Procedia Computer Science - Year 2014

    The Decisional DNA is an artificial intelligence system that uses prior experiences to shape future decisions. Decisional DNA is written in the Set Of Experience Knowledge Structure (SOEKS) and is capable of capturing and reusing a broad range of data. Decisional DNA has been implemented in several fields including Alzheimer’s diagnosis, geothermal energy and smart TV. Decisional DNA is well suited to use in robotics due to the...

    Full text available to download

  • Previous Opinions is All You Need - Legal Information Retrieval System

    Publication

    - Year 2023

    We present a system for retrieving the most relevant legal opinions to a given legal case or question. To this end, we checked several state-of-the-art neural language models. As a training and testing data, we use tens of thousands of legal cases as question-opinion pairs. Text data has been subjected to advanced pre-processing adapted to the specifics of the legal domain. We empirically chose the BERT-based HerBERT model to perform...

    Full text to download in external service