Filtry
wszystkich: 250
wybranych: 28
-
Katalog
Filtry wybranego katalogu
Wyniki wyszukiwania dla: corpus studies
-
Multimodal English corpus for automatic speech recognition
PublikacjaA multimodal corpus developed for research of speech recognition based on audio-visual data is presented. Besides usual video and sound excerpts, the prepared database contains also thermovision images and depth maps. All streams were recorded simultaneously, therefore the corpus enables to examine the importance of the information provided by different modalities. Based on the recordings, it is also possible to develop a speech...
-
An audio-visual corpus for multimodal automatic speech recognition
Publikacjareview of available audio-visual speech corpora and a description of a new multimodal corpus of English speech recordings is provided. The new corpus containing 31 hours of recordings was created specifically to assist audio-visual speech recognition systems (AVSR) development. The database related to the corpus includes high-resolution, high-framerate stereoscopic video streams from RGB cameras, depth imaging stream utilizing Time-of-Flight...
-
COLOUR TERMS IN INORGANIC CHEMISTRY: A CORPUS STUDY
Publikacja -
THE ADJECTIVES LIGHT AND DARK IN ASTROPHYSICAL TEXTS: A CORPUS STUDY
Publikacja -
A Parallel Corpus-Based Approach to the Crime Event Extraction for Low-Resource Languages
PublikacjaThese days, a lot of crime-related events take place all over the world. Most of them are reported in news portals and social media. Crime-related event extraction from the published texts can allow monitoring, analysis, and comparison of police or criminal activities in different countries or regions. Existing approaches to event extraction mainly suggest processing texts in English, French, Chinese, and some other resource-rich...
-
The Presidential Campaign of Małgorzata Kidawa-Błońska in Media Discourse: Analysis Based on Statistical Corpus Analysis and Topic Modelling
Publikacja -
Once in a season – the pragmatic function of fuck in “BoJack Horseman” TV Show
PublikacjaThis article investigates the use and pragmatic functions of the swear word fuck in the “BoJack Horseman” produced by Netflix and bridges the gap in the linguistic research on this particular TVshow. Incorporating corpus linguistics tools, the BoJack Horseman Corpus was compiled and thelemma fuck has been investigated and analysed from the multimodal perspective....
-
Electrochemical Evaluation of Sustainable Corrosion Inhibitors via Dynamic Electrochemical Impedance Spectroscopy
PublikacjaFinding suitable measurement methods for the effective management of electrochemical problems is of paramount importance, particularly for improving efficiency in corrosion protection. The need for accurate measurement techniques specific to nonstationary conditions has long been recognized, and promising approaches have emerged. This chapter introduces dynamic electrochemical impedance spectroscopy as a novel advancement in electrochemistry...
-
Enriching the Context: Methods of Improving the Non-contextual Assessment of Sentence Credibility
PublikacjaThis paper presents several methods of automatic context enrichment of sentences that need to be evaluated, tagged or fact-checked by human judges. We have created a corpus of medical Web articles. Sentences from this corpus have been fact-checked by medical experts in two modes: contextually (reading the entire article and evaluating sentence by sentence) and without context (evaluating sentences from all articles in random order)....
-
Methodology of Constructing and Analyzing the Hierarchical Contextually-Oriented Corpora
PublikacjaMethodology of Constructing and Analyzing the Hierarchical structure of the Contextually-Oriented Corpora was developed. The methodology contains the following steps: Contextual Component of the Corpora’s Structure Building; Text Analysis of the Contextually-Oriented Hierarchical Corpus. Main contribution of this study is the following: hierarchical structure of the Corpus provides advanced possibilities for identification of the...
-
Phraseological Units in Audiovisual Translation. A Case Study of Polish Dubbing of Disney’s 'The Little Mermaid'
PublikacjaThe paper aims to discuss phraseological units as the object of audiovisual translation in the Polish dubbing of Disney’s 'The Little Mermaid', to discuss the role of phraseological translation techniques, and to present possible translation inconsistencies. A theoretical introduction presents definitions for crucial terms. It is followed by the analysis of the corpus of phraseological units in Disney’s The Little Mermaid and...
-
Agile Commerce in the light of Text Mining
PublikacjaThe survey conducted for this study reveals that more than 84% of respondents have never encountered the term “agile commerce” and do not understand its meaning. At the same time, they are active participants of this strategy. Using digital channels as customers more often than ever before, they have already been included in the agile philosophy. Based on the above, the purpose of the study is to analyse major text sets containing...
-
Weakly-Supervised Word-Level Pronunciation Error Detection in Non-Native English Speech
PublikacjaWe propose a weakly-supervised model for word-level mispronunciation detection in non-native (L2) English speech. To train this model, phonetically transcribed L2 speech is not required and we only need to mark mispronounced words. The lack of phonetic transcriptions for L2 speech means that the model has to learn only from a weak signal of word-level mispronunciations. Because of that and due to the limited amount of mispronounced...
-
Language material for English audiovisual speech recognition system developmen . Materiał językowy do wykorzystania w systemie audiowizualnego rozpoznawania mowy angielskiej
PublikacjaThe bi-modal speech recognition system requires a 2-sample language input for training and for testing algorithms which precisely depicts natural English speech. For the purposes of the audio-visual recordings, a training data base of 264 sentences (1730 words without repetitions; 5685 sounds) has been created. The language sample reflects vowel and consonant frequencies in natural speech. The recording material reflects both the...
-
Unités phraséologiques au pays de la traduction: transfert des collocations nomino-adjectivales avec le lexème «femme» dans la traduction de la littérature houellebecquienne du français vers l’italien et le polonais
PublikacjaThe present paper examines the transfer of nomino-adjectival collocations based on the word ‘femme’ (‘woman’) in the literary translation from French into Italian and Polish. The lexical connection analysed in the article can be defined as the habitual juxtaposition of a word with another word (or words) that has a significant frequency in a given language. The research corpus comprises seven Michel Houellebecq’s novels written...
-
Contextual ontology for tonality assessment
Publikacjaclassification tasks. The discussion focuses on two important research hypotheses: (1) whether it is possible to construct such an ontology from a corpus of textual document, and (2) whether it is possible and beneficial to use inferencing from this ontology to support the process of sentiment classification. To support the first hypothesis we present a method of extraction of hierarchy of contexts from a set of textual documents...
-
Constructing a Dataset of Speech Recordingswith Lombard Effect
PublikacjaThepurpose of therecordings was to create a speech corpus based on the ISLEdataset, extended with video and Lombard speech. Selected from a set of 165sentences, 10, evaluatedas having thehighest possibility to occur in the context ofthe Lombard effect,were repeated in the presence of the so-called babble speech to obtain Lombard speech features. Altogether,15speakers were recorded, and speech parameterswere...
-
S’attaquer à la suprématie du masculin sur le féminin : le français inclusif dans les publications des universités françaises dans les réseaux sociaux
PublikacjaThis paper aims to examine the use of inclusive French in the Internet publications of Paris universities on their social media. Three higher education institutions were selected: Paris Dauphine-PSL University, Gustave Eiffel University, and Sorbonne Paris North University. The publications were obtained from Facebook, Instagram, and LinkedIn. Firstly, the groups of people to whom the use of inclusive French referred...
-
English, French, and Polish Aliases of Criminals: Diversity of Inspirations in their Creation and Typical Nicknaming Schemes
PublikacjaThe present paper examines the topic of aliases of criminals, which seems to be understudied in linguistic research. Therefore, this article’s primary goal is to describe how criminals’ aliases are created and what are the differences and similarities in that process in English, French, and Polish. Firstly, the theoretical background concerning the topic of pseudonyms is presented. Then, the corpus gathered for this paper (available...
-
A comparative study of English viseme recognition methods and algorithms
PublikacjaAn elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector construction...
-
A comparative study of English viseme recognition methods and algorithm
PublikacjaAn elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector...
-
Investigating Feature Spaces for Isolated Word Recognition
PublikacjaThe study addresses the issues related to the appropriateness of a two-dimensional representation of speech signal for speech recognition tasks based on deep learning techniques. The approach combines Convolutional Neural Networks (CNNs) and time-frequency signal representation converted to the investigated feature spaces. In particular, waveforms and fractal dimension features of the signal were chosen for the time domain, and...
-
Sésame, ouvre-toi: internationalisme phraséologique à contenu universel
PublikacjaPhraseological units, characterised by their opaque meaning, are the subject of multiple theoretical works. The following article adds to this discussion by providing another interesting example. It analyses the case of the Arabic phraseological unit ‘open sesame’ from the “Ali Baba and the Forty Thievesˮ folk tale, permeating into French, Italian, Polish, Turkish and Japanese – languages distant both linguistically and culturally....
-
Methodology for Text Classification using Manually Created Corpora-based Sentiment Dictionary
PublikacjaThis paper presents the methodology of Textual Content Classification, which is based on a combination of algorithms: preliminary formation of a contextual framework for the texts in particular problem area; manual creation of the Hierarchical Sentiment Dictionary (HSD) on the basis of a topically-oriented Corpus; tonality texts recognition via using HSD for analysing the documents as a collection of topically completed fragments...
-
Reaktywny system oddziaływania ze środowiskiem oparty na inteligentnym systemie decyzyjnym
PublikacjaProcesy poznawcze zachodzące w umyśle człowieka, po matematycznym zamodelowaniu i algorytmizacji, mogą by wykorzystane do konstruowania inteligentnych systemów decyzyjnych. Systemy takie mają wielorakie zastosowania. Znaleźć można je między innymi w rozmaitych autonomicznych systemach informatyki, automatyki i robotyki: począwszy od 'inteligentnego' strażnika, kamerdynera, itp., a skończywszy na opiekunie - wirtualnym towarzyszu...
-
Investigating Feature Spaces for Isolated Word Recognition
PublikacjaMuch attention is given by researchers to the speech processing task in automatic speech recognition (ASR) over the past decades. The study addresses the issue related to the investigation of the appropriateness of a two-dimensional representation of speech feature spaces for speech recognition tasks based on deep learning techniques. The approach combines Convolutional Neural Networks (CNNs) and timefrequency signal representation...
-
Audio Feature Analysis for Precise Vocalic Segments Classification in English
PublikacjaAn approach to identifying the most meaningful Mel-Frequency Cepstral Coefficients representing selected allophones and vocalic segments for their classification is presented in the paper. For this purpose, experiments were carried out using algorithms such as Principal Component Analysis, Feature Importance, and Recursive Parameter Elimination. The data used were recordings made within the ALOFON corpus containing audio signal...
-
Glossary [Intellectual Output 1] Glossary as a method for reflection on complex research questions
PublikacjaGlobalization and digitization are strongly influencing the process of shaping the built environment. The latter is causing the new design tools to emerge faster than ever before in history, while the former is speeding up not only the development, but also the broad roll-out of more agile and interdisciplinary methodologies and work approaches. The design process is also becoming more and more inter- and trans-disciplinary. This...