Wyniki wyszukiwania dla: large language models - MOST Wiedzy

Wyszukiwarka

Wyniki wyszukiwania dla: large language models

Filtry

wszystkich: 5674
wybranych: 3928

wyczyść wszystkie filtry


Filtry wybranego katalogu

  • Kategoria

  • Rok

  • Opcje

wyczyść Filtry wybranego katalogu niedostępne

Wyniki wyszukiwania dla: large language models

  • Language Models in Speech Recognition

    Publikacja

    - Rok 2022

    This chapter describes language models used in speech recognition, It starts by indicating the role and the place of language models in speech recognition. Mesures used to compare language models follow. An overview of n-gram, syntactic, semantic, and neural models is given. It is accompanied by a list of popular software.

    Pełny tekst do pobrania w serwisie zewnętrznym

  • OrphaGPT: An Adapted Large Language Model for Orphan Diseases Classification

    Publikacja

    - Rok 2024

    Orphan diseases (OD) represent a category of rare conditions that affect only a relatively small number of individuals. These conditions are often neglected in research due to the challenges posed by their scarcity, making medical advancements difficult. Then, the ever-evolving medical research and diagnosis landscape calls for more attention and innovative approaches to address the complex challenges of rare diseases and OD. Pre-trained...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Finite automata for compact representation of language models in NLP

    Publikacja

    Przedstawiona zostaje technika reprezentacji modeli języka w przetwarzaniu języka naturalnego wymagająca mało pamięci. Po krótkim omówieniu przyczyn poszukiwania oszczędnej reprezentacji takich modeli języka, pokazane jest, jak automaty skończone mogą być użyte w tym celu. Technika może być postrzegana jako zastosowanie i rozszerzenie doskonałej funkcji mieszającej z wykorzystaniem automatów skończonych. Pierwsze doświadczenia...

  • Visual Low-Code Language for Orchestrating Large-Scale Distributed Computing

    Publikacja
    • K. Rybiński
    • M. Śmiałek
    • A. Sostaks
    • K. Marek
    • R. Roszczyk
    • M. Wdowiak

    - Journal of Grid Computing - Rok 2023

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Information Extraction from Polish Radiology Reports using Language Models

    Publikacja

    Radiology reports are vital elements of directing patient care. They are usually delivered in free text form, which makes them prone to errors, such as omission in reporting radiological findings and using difficult-to-comprehend mental shortcuts. Although structured reporting is the recommended method, its adoption continues to be limited. Radiologists find structured reports too limiting and burdensome. In this paper, we propose...

    Pełny tekst do pobrania w portalu

  • Comparison of Language Models Trained on Written Texts and Speech Transcripts in the Context of Automatic Speech Recognition

    Publikacja
    • S. Dziadzio
    • A. Nabożny
    • A. Smywiński-Pohl
    • B. Ziółko

    - Rok 2015

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Design and experimental evaluation of pod propulsor models for a large self-propelled ship model.

    Publikacja

    Artykuł przedstawia serię hydrodynamicznych badań swobodnych dwóch modeli pędnika podowego. Pędniki zostały zaprojektowane i zbudowane specjalnie dla dwóch wersji dużego modelu okrętu z własnym napędem, przeznaczonego do eksperymentów manewrowych. Jedna wersja jest napędzana pojedyńczym pędnikiem, druga jest wyposażona w dwa pędniki. Oba modele podów były badane w kanale obiegowym. Celem eksperymentu były pomiary sześciu składowych...

  • Path integrals formulations leading to propagator evaluation for coupled linear physics in large geometric models

    Publikacja

    - COMPUTER PHYSICS COMMUNICATIONS - Rok 2024

    Reformulating linear physics using second kind Fredholm equations is very standard practice. One of the straightforward consequences is that the resulting integrals can be expanded (when the Neumann expansion converges) and probabilized, leading to path statistics and Monte Carlo estimations. An essential feature of these algorithms is that they also allow to estimate propagators for all types of sources, including initial conditions....

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Performance of various risk prediction models in a large lung cancer screening cohort in Gdańsk, Poland—a comparative study

    Publikacja
    • M. Ostrowski
    • F. Bińczyk
    • T. Marjański
    • R. Dziedzic
    • S. Pisiak
    • S. Małgorzewicz
    • M. Adamek
    • J. Polańska
    • W. Rzyman

    - Translational Lung Cancer Research - Rok 2021

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Quantifying inconsistencies in the Hamburg Sign Language Notation System

    Publikacja
    • M. Ferlin
    • S. Majchrowska
    • M. A. Plantykow
    • A. Kwaśniewska
    • A. Mikołajczyk-Bareła
    • M. Olech
    • J. Nalepa

    - EXPERT SYSTEMS WITH APPLICATIONS - Rok 2024

    The advent of machine learning (ML) has significantly advanced the recognition and translation of sign languages, bridging communication gaps for hearing-impaired communities. At the heart of these technologies is data labeling, crucial for training ML algorithms on a huge amount of consistently labeled data to achieve models that generalize well. The adoption of language-agnostic annotations is essential to connect different sign...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Beyond Traditional Learning: The LLM Revolution in BPM Education at University

    Publikacja

    - Rok 2024

    Large Language Models (LLMs) significantly impact higher education, requiring changes in educational processes, especially in Business Process Management (BPM) practical exercises. The research aims to evaluate the effectiveness of LLMs in BPM education to determine if LLMs can supplement educators. The study involved 33 master’s degree students. Students’ works were manually evaluated and compared to LLM-generated responses. Results...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • News that Moves the Market: DSEX-News Dataset for Forecasting DSE Using BERT

    Publikacja

    - Rok 2024

    Stock market is a complex and dynamic industry that has always presented challenges for stakeholders and investors due to its unpredictable nature. This unpredictability motivates the need for more accurate prediction models. Traditional prediction models have limitations in handling the dynamic nature of the stock market. Additionally, previous methods have used less relevant data, leading to suboptimal performance. This study...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Sign Language Recognition Using Convolution Neural Networks

    Publikacja

    The objective of this work was to provide an app that can automatically recognize hand gestures from the American Sign Language (ASL) on mobile devices. The app employs a model based on Convolutional Neural Network (CNN) for gesture classification. Various CNN architectures and optimization strategies suitable for devices with limited resources were examined. InceptionV3 and VGG-19 models exhibited negligibly higher accuracy than...

    Pełny tekst do pobrania w portalu

  • Modelling and simulation of GPU processing in the MERPSYS environment

    In this work, we evaluate an analytical GPU performance model based on Little's law, that expresses the kernel execution time in terms of latency bound, throughput bound, and achieved occupancy. We then combine it with the results of several research papers, introduce equations for data transfer time estimation, and finally incorporate it into the MERPSYS framework, which is a general-purpose simulator for parallel and distributed...

    Pełny tekst do pobrania w portalu

  • DEVELOPMENT OF THE ALGORITHM OF POLISH LANGUAGE FILM REVIEWS PREPROCESSING

    The algorithm and the software for conducting the procedure of Preprocessing of the reviews of films in the Polish language were developed. This algorithm contains the following steps: Text Adaptation Procedure; Procedure of Tokenization; Procedure of Transforming Words into the Byte Format; Part-of-Speech Tagging; Stemming / Lemmatization Procedure; Presentation of Documents in the Vector Form (Vector Space Model) Procedure; Forming...

    Pełny tekst do pobrania w portalu

  • An Analysis of Neural Word Representations for Wikipedia Articles Classification

    Publikacja

    - CYBERNETICS AND SYSTEMS - Rok 2019

    One of the current popular methods of generating word representations is an approach based on the analysis of large document collections with neural networks. It creates so-called word-embeddings that attempt to learn relationships between words and encode this information in the form of a low-dimensional vector. The goal of this paper is to examine the differences between the most popular embedding models and the typical bag-of-words...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Towards facts extraction from text in Polish language

    Publikacja

    Natural Language Processing (NLP) finds many usages in different fields of endeavor. Many tools exists allowing analysis of English language. For Polish language the situation is different as the language itself is more complicated. In this paper we show differences between NLP of Polish and English language. Existing solutions are presented and TEAMS software for facts extraction is described. The paper shows also evaluation of...

    Pełny tekst do pobrania w portalu

  • Ontology of the Design Pattern Language for Smart Cities Systems

    Publikacja

    The paper presents the definition of the design pattern language of Smart Cities in the form of an ontology. Since the implementation of a Smart City system is difficult, expensive and closely linked with the problems concerning a given city, the knowledge acquired during a single implementation is extremely valuable. The language we defined supports the management of such knowledge as it allows for the expression of a solution...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Semantic modeling of contextual augmented reality environments

    Publikacja
    • D. Rumiński

    - Rok 2018

    Despite significant progress in the field of augmented reality (AR), regarding both hardware and software, there is still a lack of universal models and methods that would enable building ubiquitous AR systems that could be used anywhere and anytime, covering different application areas. This dissertation describes a new approach to building AR systems, called the Contextual Augmented Reality Environment (CARE). The CARE approach...

    Pełny tekst do pobrania w portalu

  • The Impact of Foreign Accents on the Performance of Whisper Family Models Using Medical Speech in Polish

    Publikacja

    - Rok 2024

    The article presents preliminary experiments investigating the impact of accent on the performance of the Whisper automatic speech recognition (ASR) system, specifically for the Polish language and medical data. The literature review revealed a scarcity of studies on the influence of accents on speech recognition systems in Polish, especially concerning medical terminology. The experiments involved voice cloning of selected individuals...

    Pełny tekst do pobrania w portalu

  • English Language Learning Employing Developments in Multimedia IS

    Publikacja

    In the realm of the development of information systems related to education, integrating multimedia technologies offers novel ways to enhance foreign language learning. This study investigates audio-video processing methods that leverage real-time speech rate adjustment and dynamic captioning to support English language acquisition. Through a mixed-methods analysis involving participants from a language school, we explore the impact...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • A survey of automatic speech recognition deep models performance for Polish medical terms

    Among the numerous applications of speech-to-text technology is the support of documentation created by medical personnel. There are many available speech recognition systems for doctors. Their effectiveness in languages such as Polish should be verified. In connection with our project in this field, we decided to check how well the popular speech recognition systems work, employing models trained for the general Polish language....

    Pełny tekst do pobrania w serwisie zewnętrznym

  • An Approach to Trust Case Development

    Publikacja

    In the paper we present an approach to the architectural trust case development for DRIVE, the IT infrastructure supporting the processes of drugs distribution and application. The objectives of DRIVE included safer and cheaper drugs distribution and application. A trust case represents an argument supporting the trustworthiness of the system. It is decomposed into claims that postulate some trust related properties. Claims differ...

    Pełny tekst do pobrania w portalu

  • Towards Facts Extraction From Texts in Polish Language

    The Polish language differs from English in many ways. It has more complicated conjugation and declination. Because of that automatic facts extraction from texts is difficult. In this paper we present basic differences between those languages. The paper presents an algorithm for extraction of facts from articles from Polish Wikipedia. The algorithm is based on 7 proposed facts schemes that are searched for in the analyzed text....

    Pełny tekst do pobrania w portalu

  • Knowledge Base Suitable for Answering Questions in Natural Language

    This paper presents three knowledge bases widely used by researchers coping with natural language processing: OpenCyc, DBpedia and YAGO. They are characterized from the point of view of questions answering system. In this paper a short description of the aforementioned system implementation is also presented.

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Automatic Classification of Polish Sign Language Words

    In the article we present the approach to automatic recognition of hand gestures using eGlove device. We present the research results of the system for detection and classification of static and dynamic words of Polish language. The results indicate the usage of eGlove allows to gain good recognition quality that additionally can be improved using additional data sources such as RGB cameras.

    Pełny tekst do pobrania w portalu

  • MERPSYS: An environment for simulation of parallel application execution on large scale HPC systems

    In this paper we present a new environment called MERPSYS that allows simulation of parallel application execution time on cluster-based systems. The environment offers a modeling application using the Java language extended with methods representing message passing type communication routines. It also offers a graphical interface for building a system model that incorporates various hardware components such as CPUs, GPUs, interconnects...

    Pełny tekst do pobrania w portalu

  • Extracting concepts from the software requirements specification using natural language processing

    Publikacja

    - Rok 2018

    Extracting concepts from the software require¬ments is one of the first step on the way to automating the software development process. This task is difficult due to the ambiguity of the natural language used to express the requirements specification. The methods used so far consist mainly of statistical analysis of words and matching expressions with a specific ontology of the domain in which the planned software will be applicable....

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Language material for English audiovisual speech recognition system developmen . Materiał językowy do wykorzystania w systemie audiowizualnego rozpoznawania mowy angielskiej

    Publikacja

    - Rok 2013

    The bi-modal speech recognition system requires a 2-sample language input for training and for testing algorithms which precisely depicts natural English speech. For the purposes of the audio-visual recordings, a training data base of 264 sentences (1730 words without repetitions; 5685 sounds) has been created. The language sample reflects vowel and consonant frequencies in natural speech. The recording material reflects both the...

  • A Model-Driven Solution for Development of Multimedia Stream Processing Applications

    Publikacja

    This paper presents results of action research related to model-driven solutions in the area of multimedia stream processing. The practical problem to be solved was the need to support application developers who make their multimedia stream processing applications in a supercomputer environment. The solution consists of a domain-specific visual language for composing complex services from simple services called Multimedia Stream...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Creating Polish space language dictionary - lessons learned

    Publikacja

    Polish space industry suffers from lack of space vocabulary. Since joining European Space Agency in 2012, the sector has expanded rapidly now employing over 1000 specialists focusing mainly on space sustainability, space debris detection and tracking, robotics and propulsion systems. The Polish Space Agency together with The Polish Committee for Standardization have committed to creating the first lexicon of space language, along...

    Pełny tekst do pobrania w portalu

  • Web Services Choreography Description Language - WSCDL.

    Publikacja

    - Rok 2004

    Język Web Services Choreography Description Language służy do opisu współpracy równy z równym. Został zaprojektowany z myślą o automatyzacji współpracy usług sieciowych, ale jest na tyle ogólny, że pozwala opisywać współpracę nie tylko w świecie komputerowym. Prezentowana jest geneza tego języka oraz jego model. Następnie opisana jest struktura języka poparta przykładem dokumentu napisanego w języku WSCDL.

  • Language of Benefits as a Novel Tool for Improving Website Personalization

    A properly designed website allows the user to search for information faster, and more accurately. The information content of the website should be also adapted to the needs of the user. The purpose of this article is to present a novel Language of Benefits (LoB) approach to facilitate the use of websites for individual user groups. The LoB approach is an approach addressed to IT Analysts, to facilitate the process of web design,...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Terminological and Assertional Queries in KQL Knowledge Access Language

    One of the directions of development of information systems in recent years in the evolution of data-based systems into the knowledge-based systems. As a part of this process there is ongoing work on a whole range of languages for accessing knowledge bases. They can be used in a variety of applications, however their main drawback is the lack of clearly defined algebra representing a theoretical basis for them. For instance, such...

  • Analysis of economical lighting of highways in the environment of SMOL language

    The paper puts forward and implements a method of designing and creating a modelling simulation environment for eztensive and complete analysis of economical lighting on highways. From a general design viewpoint, the proposed solution explores the concept of a network description language (SMOL), which has been designed to describe the necessary network functions, mechanisms, and devices; for the purpose of their computer simulation...

    Pełny tekst do pobrania w portalu

  • The Use of the Language of Mathematics as an Inspiration for Contemporary Architectural Design

    Publikacja

    The purpose of the article is to present the evolution of the use of mathematical language as an inspiration for creating spatial, three-dimensional forms in art and architecture. The article focuses on the possibilities for art and architectural design ideas gained by contemporary mathematics, algorithms and computational parametric approach. The analysis of various examples represents the relationships between the composition...

    Pełny tekst do pobrania w portalu

  • Words context analysis for improvement of information retrieval

    Publikacja

    - Rok 2012

    In the article we present an approach to improvement of retrieval informationfrom large text collections using words context vectors. The vectorshave been created analyzing English Wikipedia with Hyperspace Analogue to Language model of words similarity. For test phrases we evaluate retrieval with direct user queries as well as retrieval with context vectors of these queries. The results indicate that the proposed method can not...

  • Geometric Algebra Model of Distributed Representations

    Publikacja

    - Rok 2010

    Formalism based on GA is an alternative to distributed representation models developed so far-Smolensky's tensor product, Holographic Reduced Representations (HRR) and Binary Spatter Code (BSC). Convolutions are replaced by geometric products, interpretable in terms of geometry which seems to be the most natural language for visualization of higher concepts. This paper recalls the main ideas behind the GA model and investigates...

  • DBpedia and YAGO as Knowledge Base for Natural Language Based Question Answering—The Evaluation

    The idea of automatic question answering system has a very long history. Despite constant improvement of the systems asking questions in the natural language requires very complex solutions. In this paper the DBpedia and YAGO are evaluated as a knowledge bases for simple class 1 and 2 question answering system. For this purpose a question answering system was designed and implemented. The proposed solution and the knowledge bases...

    Pełny tekst do pobrania w portalu

  • Modelling of the High Speed Multi-Pole Synchronous Generator for Application in More Electric Aircraft Power Systems

    Publikacja

    In this paper different models of the synchronous generator are presented. The simulation results compared with the measurements are shown. Certain physical phenomena are included in described models for the porpoise of adequate analysis of the more electric aircraft power system. For different modelling levels, such as functional level or behavioural level, different physical phenomena have been included. Simulation results for...

  • Description Logic As A Common Software Engineering Artifacts Language

    Publikacja

    - Rok 2008

    Description logic is proposed as a powerful language able to support chosen software engineering process tasks like: requirements engineering, software architecture definition, software design and configuration management. To do this there is presented a correspondence between description logic and UML. Description logic based integrated software engineering process framework is proposed which owing to automatic knowledge inferring...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Badania empiryczne związane z ewolucją języków - wybrane zagadnienia

    Although language evolution is an area in science yet to be developed, its foundations lay on empirical research. The aim of this article is to present three categories of ways to get empirical data on language evolution: observing language in laboratory, monitoring animal communication and analysing pidgins and creoles. The part of the paper about language in laboratory bases on English-language articles presenting the experiments...

  • Individual Resources and Intercultural Interactions

    Publikacja

    - Rok 2017

    The work environment in multinational corporations (MNCs) is specific and demanding including intercultural interactions with co-workers and clients and using a foreign language. Some individual resources can help in dealing with these circumstances. Individual resources refer to personal dispositions, competencies and prior experiences. With regard to previous studies, a caravan of personal resources, namely Psychological Capital...

  • Viability of decisional DNA in robotics

    Publikacja

    - Procedia Computer Science - Rok 2014

    The Decisional DNA is an artificial intelligence system that uses prior experiences to shape future decisions. Decisional DNA is written in the Set Of Experience Knowledge Structure (SOEKS) and is capable of capturing and reusing a broad range of data. Decisional DNA has been implemented in several fields including Alzheimer’s diagnosis, geothermal energy and smart TV. Decisional DNA is well suited to use in robotics due to the...

    Pełny tekst do pobrania w portalu

  • Previous Opinions is All You Need - Legal Information Retrieval System

    Publikacja

    - Rok 2023

    We present a system for retrieving the most relevant legal opinions to a given legal case or question. To this end, we checked several state-of-the-art neural language models. As a training and testing data, we use tens of thousands of legal cases as question-opinion pairs. Text data has been subjected to advanced pre-processing adapted to the specifics of the legal domain. We empirically chose the BERT-based HerBERT model to perform...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Ontology-Aided Software Engineering

    Publikacja

    - Rok 2012

    This thesis is located between the fields of research on Artificial Intelligence (AI), Knowledge Representation and Reasoning (KRR), Computer-Aided Software Engineering (CASE) and Model Driven Engineering (MDE). The modern offspring of KRR - Description Logic (DL) [Baad03] is considered here as a formalization of the software engineering Methods & Tools. The bridge between the world of formal specification (governed by the mathematics)...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Optimizing Medical Personnel Speech Recognition Models Using Speech Synthesis and Reinforcement Learning

    Text-to-Speech synthesis (TTS) can be used to generate training data for building Automatic Speech Recognition models (ASR). Access to medical speech data is because it is sensitive data that is difficult to obtain for privacy reasons; TTS can help expand the data set. Speech can be synthesized by mimicking different accents, dialects, and speaking styles that may occur in a medical language. Reinforcement Learning (RL), in the...

    Pełny tekst do pobrania w portalu

  • Using FreeFEM open software for modelling the vibrations of piezoelectric devices

    Modelling vibrations of piezoelectric transducers has been a topic discussed in the literature for many decades. The first models - so-called one-dimensional - describe the vibrations only near operating frequency and near its harmonics. Attempts to introduce two-dimensional models were related to the possibility of one transducer working at several frequencies, including both thickness vibrations and those resulting from the transducer...

    Pełny tekst do pobrania w portalu

  • Semantic OLAP with FluentEditor and Ontorion Semantic Excel Toolchain

    Publikacja

    - Rok 2015

    Semantic technologies appear as a step on the way to creating systems capable of representing the physical world as real time computational processes. In this context, the paper presents a toolchain for an ontology based knowledge management system. It consists of the ontology editor, FluentEditor and the distributed knowledge representation system, Ontorion. FluentEditor is a comprehensive tool for editing and manipulating complex...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • SYNTHESIZING MEDICAL TERMS – QUALITY AND NATURALNESS OF THE DEEP TEXT-TO-SPEECH ALGORITHM

    The main purpose of this study is to develop a deep text-to-speech (TTS) algorithm designated for an embedded system device. First, a critical literature review of state-of-the-art speech synthesis deep models is provided. The algorithm implementation covers both hardware and algorithmic solutions. The algorithm is designed for use with the Raspberry Pi 4 board. 80 synthesized sentences were prepared based on medical and everyday...

    Pełny tekst do pobrania w portalu