Search results for: TEXT PROCESSING
-
Text Technology: A Journal of computer Text Processing
Journals -
Towards Effective Processing of Large Text Collections
PublicationIn the article we describe the approach to parallelimplementation of elementary operations for textual data categorization.In the experiments we evaluate parallel computations ofsimilarity matrices and k-means algorithm. The test datasets havebeen prepared as graphs created from Wikipedia articles relatedwith links. When we create the clustering data packages, wecompute pairs of eigenvectors and eigenvalues for visualizationsof...
-
International Conference on Intelligent Text Processing and Computational Linguistics
Conferences -
Time-domain prosodic modifications for text-to-speech synthesizer
PublicationAn application of prosodic speech processing algorithms to Text-To-Speech synthesis is presented. Prosodic modifications that improve the naturalness of the synthesized signal are discussed. The applied method is based on the TD-PSOLA algorithm. The developed Text-To-Speech Synthesizer is used in applications employing multimodal computer interfaces.
-
Semantic Analysis and Text Summarization in Socio-Technical Systems
PublicationIn this chapter the authors present the results of the development the methodology for increasing the reliability of the functioning of the Socio-Technical System. The existed methods and algorithms for processing unstructured (textual) information were studied. Taking into account noted above strengths and weaknesses of Discriminant and Probabilistic approaches of Latent Semantic Relations analysis in of the summarization projection...
-
Comparative Analysis of Text Representation Methods Using Classification
PublicationIn our work, we review and empirically evaluate five different raw methods of text representation that allow automatic processing of Wikipedia articles. The main contribution of the article—evaluation of approaches to text representation for machine learning tasks—indicates that the text representation is fundamental for achieving good categorization results. The analysis of the representation methods creates a baseline that cannot...
-
Endoscopic Videos Deinterlacing and On-Screen Text and Light Flashes Removal and Its Influence on Image Analysis Algorithms' Efficiency
PublicationIn this article, deinterlacing and removing on- screen text and light flashes methods on endoscopic video images are discussed. The research is intended to improve disease recognition algorithms' performance. In the article, four configurations of deinterlacing methods and another four configurations of text and flashes removal methods are described and examined. The efficiency of endoscopic video analysis algorithms is measured...
-
Developing Methods for Building Intelligent Systems of Information Resources Processing Using an Ontological Approach
PublicationThe problem of developing methods of information resource processing is investigated. A formal procedure description of processing text content is developed. A new ontological approach to the implementation of business processes is proposed. Consider that the aim of our work is to develop methods and tools for building intelligent systems of information resource processing, the core of knowledge bases of which are ontology’s, and...
-
Influence of YARN Schedulers on Power Consumption and Processing Time for Various Big Data Benchmarks
PublicationClimate change caused by human activities can influence the lives of everybody onthe planet. The environmental concerns must be taken into consideration by all fields of studyincludingICT. Green Computing aims to reduce negative effects of IT on the environment while,at the same time, maintaining all of the possible benefits it provides. Several Big Data platformslike Apache Spark orYARNhave become widely used in analytics and...
-
Assessing business process complexity based on textual data: Evidence from ITIL IT ticket processing
PublicationPurpose This study aims to draw the attention of business process management (BPM) research and practice to the textual data generated in the processes and the potential of meaningful insights extraction. The authors apply standard natural language processing (NLP) approaches to gain valuable knowledge in the form of business process (BP) complexity concept suggested in the study. It is built on the objective, subjective and meta-knowledge...
-
SYNTHESIZING MEDICAL TERMS – QUALITY AND NATURALNESS OF THE DEEP TEXT-TO-SPEECH ALGORITHM
PublicationThe main purpose of this study is to develop a deep text-to-speech (TTS) algorithm designated for an embedded system device. First, a critical literature review of state-of-the-art speech synthesis deep models is provided. The algorithm implementation covers both hardware and algorithmic solutions. The algorithm is designed for use with the Raspberry Pi 4 board. 80 synthesized sentences were prepared based on medical and everyday...
-
Previous Opinions is All You Need - Legal Information Retrieval System
PublicationWe present a system for retrieving the most relevant legal opinions to a given legal case or question. To this end, we checked several state-of-the-art neural language models. As a training and testing data, we use tens of thousands of legal cases as question-opinion pairs. Text data has been subjected to advanced pre-processing adapted to the specifics of the legal domain. We empirically chose the BERT-based HerBERT model to perform...
-
Blockchain based Secure Data Exchange between Cloud Networks and Smart Hand-held Devices for use in Smart Cities
PublicationIn relation to smart city planning and management, processing huge amounts of generated data and execution of non-lightweight cryptographic algorithms on resource constraint devices at disposal, is the primary focus of researchers today. To enable secure exchange of data between cloud networks and mobile devices, in particular smart hand held devices, this paper presents Blockchain based approach that disperses a public/free key...
-
Radar with rotary head
PublicationNowadays usage of radars is no longer reserved only for the military purpose. It finds many applications in various areas of science and industry. It may be used in order to obtain extended information about the state of critical infrastructure, like shipyards or petrochemical plants. Furthermore, it may be applied in vision denied environments. The aim of this project...
-
Tomasz Maria Boiński dr inż.
PeopleI’m associated with the University since the year 2000 when I started my studies in Computer Science on the Faculty of Electronics, Telecommunications and Informatics. After graduating with honors in 2005, I applied for doctoral studies. During his studies and immediately afterward I was involved in cooperation with Hogart from Warsaw, in the implementation of business solutions in Gdynia company Elektronika SA (Infor FMS SunSystems)...
-
SEMANTIC ANALYSIS ALGORITHMS FOR KNOWLEDGE WORKERS SUPPORT
PublicationThe paper examines various aspects of text analysis application for knowledge worker’s activity realization. Conclusions are drawn about the relevance and importance of processing the non-structured textual information in order to increase knowledge worker’s efficiency, as well as their awareness in different branches of science. The paper considers the existing algorithms of texts semantic analysis as the sphere of documents topical...
-
Towards a Framework for Context Awareness Based on Textual Process Data
PublicationContext awareness is critical for the successful execution of processes. In the abundance of business process management (BPM) research, frameworks exclusively devoted to extracting context from textual process data are scarce. With the deluge of textual data and its increasing value for organizations, it be-comes essential to employ relevant text analytics techniques to increase the awareness of business process (BP) workers,...
-
Intelligent information services 23/24
e-Learning CoursesInformation retrieval Text categorization Natural language processing
-
Business Sentiment Analysis. Concept and Method for Perceived Anticipated Effort Identification
PublicationRepresenting a valuable human-computer interaction interface, Sentiment Analysis (SA) is applied to a wide range of problems. In the present paper, the researchers introduce a novel concept of Business Sentiment (BS) as a measurement of a Perceived Anticipated Effort (PAE) in the context of business processes (BPs). BS is considered as an emotional component of BP task contextual complexity perceived by a process worker after reading...
-
Evaluation of a company’s image on social media using the Net Sentiment Rate
PublicationVast amounts of new types of data are constantly being created as a result of dynamic digitization in all areas of our lives. One of the most important and valuable categories for business is data from social networks such as Facebook. Feedback resulting from the sharing of thoughts and emotions, expressed in comments on various products and services, is becoming the key factor on which modern business is based. This feedback is...
-
On the Possibilities of Decreasing Power Loss in Large Tilting Pad Thrust Bearings
PublicationJournal Menu About this Journal Abstracting and Indexing Aims and Scope Article Processing Charges Articles in Press Author Guidelines Bibliographic Information Citations to this Journal Contact Information Editorial Board Editorial Workflow Free eTOC Alerts Publication Ethics Submit a Manuscript Table of Contents Abstract Full-Text PDF Full-Text HTML...
-
Relation-based Wikipedia Search System for Factoid Questions Answering
PublicationIn this paper we propose an alternative keyword search mechanism for Wikipedia, designed as a prototype solution towards factoid questions answering. The method considers relations between articles for finding the best matching article. Unlike the standard Wikipedia search engine and also Google engine, which search the articles content independently, requiring the entire query to be satisfied by a single article, the proposed...
-
Greenhouse control system design
PublicationCoraz większa populacja i zmniejszające się tereny uprawne wymusza efektywniejsze metody uprawy roślin. Zaradzić temu mogą układu hydroponiczne, które dzięki rozwojowi techniki są w stanie osiągać znacznie większe oraz bardziej jednorodne plony. Jest to możliwe dzięki zaawansowanym systemom opartym na dokładnych urządzeniach pomiarowych, sterowaniu w zamkniętej pętli oraz mikrokontrolerom umożliwiającym...
-
Towards facts extraction from text in Polish language
PublicationNatural Language Processing (NLP) finds many usages in different fields of endeavor. Many tools exists allowing analysis of English language. For Polish language the situation is different as the language itself is more complicated. In this paper we show differences between NLP of Polish and English language. Existing solutions are presented and TEAMS software for facts extraction is described. The paper shows also evaluation of...
-
Borders of Digital Art in the Context of the Information Society
PublicationThe article shows the relationship between the development of information technologies, the characteristics of the information society and digital art. The broad possibilities of the digital world related to recording, storing and processing data (cyber text, big data, smart services) and the creation of virtual worlds are pointed out. The influence of the development of information technologies on the character of the works of...
-
Can augmented democracy fulfil the ideal of deliberative democracy?
PublicationNew technologies can broaden the scope of human limits. We can fly to the moon thanks to rocket science, we can communicate with each other by mobile phones and WiFi and we can exceed human life with medical equipment like pacemakers. But there are aspects of our life that still are very distrustful of implementing new technology into functioning. An example of such a field of our everyday life is...
-
SIGIR workshop: Stylistic Analysis of Text For Information Access
Conferences -
Modeling the Customer’s Contextual Expectations Based on Latent Semantic Analysis Algorithms
PublicationNowadays, in the age of Internet, access to open data detects the huge possibilities for information retrieval. More and more often we hear about the concept of open data which is unrestricted access, in addition to reuse and analysis by external institutions, organizations and people. It’s such information that can be freely processed, add another data (so-called remix) and then published. More and more data are available in text...
-
The image of the City on social media: A comparative study using “Big Data” and “Small Data” methods in the Tri-City Region in Poland
Publication“The Image of the City” by Kevin Lynch is a landmark planning theory of lasting influence; its scientific rigor and relevance in the digital age were in dispute. The rise of social media and other digital technologies offers new opportunities to study the perception of urban environments. Questions remain as to whether social media analytics can provide a reliable measure of perceived city images? If yes, what implication does...
-
Methodology of Selecting the Hadoop Ecosystem Configuration in Order to Improve the Performance of a Plagiarism Detection System
PublicationThe plagiarism detection problem involves finding patterns in unstructured text documents. Similarity of documents in this approach means that the documents contain some identical phrases with defined minimal length. The typical methods used to find similar documents in dig- ital libraries are not suitable for this task (plagiarism detection) because found documents may contain similar content and we have not any war- ranty that...
-
Zaawansowane Przetwarzanie Sygnałów - angielski - Nowy
e-Learning CoursesLecture presents development of advanced signal processing techniques. Seminar, after the lectures, considers selected issues/subjects of signal processing techniques presented by individual students.
-
L1 Cell Adhesion Molecule Overexpression Down Regulates Phosphacan and Up Regulates Structural Plasticity-Related Genes Rostral and Caudal to the Complete Spinal Cord Transection
PublicationL1 cell adhesion molecule (L1CAM) supports spinal cord cellular milieu after contusion and compression lesions, contributing to neuroprotection, promoting axonal outgrowth, and reducing outgrowth-inhibitory molecules in lesion proximity. We extended investigations into L1CAM molecular targets and explored long-distance effects of L1CAM rostral and caudal to complete spinal cord transection (SCT) in...
-
Retrieval with Semantic Sieve
PublicationThe article presents an algorithm we called Semantic Sieve applied for refining search results in text documents repository. The algorithm calculates socalled conceptual directions that enables interaction with the user and allows to narrow the set of results to the most relevant ones. We present the system where the algorithm has been implemented. The system also offers in the presentation layer clustering of the results into...
-
Generating actionable evidence from free-text feedback to improve maternity and acute hospital experiences: A computational text analytics & predictive modelling approach
PublicationBackground Patient experience surveys are a key source of evidence for supporting decision-making and quality improvement in healthcare services. These surveys contain two main types of questions: closed and open-ended, asking about patients’ care experiences. Apart from the knowledge obtained from analysing closed-ended questions, invaluable insights can be gleaned from free-text data. Advanced analytics techniques are increasingly...
-
Development and Research of the Text Messages Semantic Clustering Methodology
PublicationThe methodology of semantic clustering analysis of customer’s text-opinions collection is developed. The author's version of the mathematical models of formalization and practical realization of short textual messages semantic clustering procedure is proposed, based on the customer’s text-opinions collection Latent Semantic Analysis knowledge extracting method. An algorithm for semantic clustering of the text-opinions is developed,...
-
Image Processing in Robotics (2021/2022)
e-Learning CoursesFor ISD M.Sc. (II degr.) 2 sem. Participants are to learn image processing algorithms related to transformation, filtration, feature detection (image descriptors), image processing algorithms in robotic industrial systems.
-
Environmental Protection in Energetics, PG_00049751, W, PE-ET, sem.1, winter 2023/24
e-Learning CoursesEnvironmental aspects of energy production and processing.
-
Environmental Protection in Energetics (PG_00049751), W, PE-ET, sem.1, winter 2023/24
e-Learning CoursesEnvironmental aspects of energy production and processing.
-
Study of Statistical Text Representation Methods for Performance Improvement of a Hierarchical Attention Network
PublicationTo effectively process textual data, many approaches have been proposed to create text representations. The transformation of a text into a form of numbers that can be computed using computers is crucial for further applications in downstream tasks such as document classification, document summarization, and so forth. In our work, we study the quality of text representations using statistical methods and compare them to approaches...
-
Prioritising national healthcare service issues from free text feedback – A computational text analysis & predictive modelling approach
PublicationPatient experience surveys have become a key source of evidence for supporting decision-making and continuous quality improvement within healthcare services. To harness free-text feedback collected as part of these surveys for additional insights, text analytics methods are increasingly employed when the data collected is not amenable to traditional qualitative analysis due to volume. However, while text analytics techniques offer...
-
A survey of automatic speech recognition deep models performance for Polish medical terms
PublicationAmong the numerous applications of speech-to-text technology is the support of documentation created by medical personnel. There are many available speech recognition systems for doctors. Their effectiveness in languages such as Polish should be verified. In connection with our project in this field, we decided to check how well the popular speech recognition systems work, employing models trained for the general Polish language....
-
What matters most to patients? On the Core Determinants of Patient Experience from Free Text Feedback
PublicationFree-text feedback from patients is increasingly used for improving the quality of healthcare services and systems. A major reason for the growing interest in harnessing free-text feedback is the belief that it provides richer information about what patients want and care about. The use of computational approaches such as structural topic modelling for analysing large unstructured textual data such as free-text feedback from patients...
-
Wyspiański Pavilion
PublicationText on Wyspiański Pavilion in Cracow.
-
Gdańsk Shakespeare Theatre
PublicationText on Gdańsk Shakespeare Theatre.
-
Review on Wikification methods
PublicationThe paper reviews methods on automatic annotation of texts with Wikipedia entries. The process, called Wikification aims at building references between concepts identified in the text and Wikipedia articles. Wikification finds many applications, especially in text representation, where it enables one to capture the semantic similarity of the documents. Also, it can be considered as automatic tagging of the text. We describe typical...
-
Rozliczalność władzy politycznej jako element wzmocnienia demokracji i podwyższenia jej jakości: przykład Polski
PublicationCelem artykułu jest wskazanie odpowiednich,...
-
European Solidarity Centre
PublicationText on European Solidarity Centre in Gdansk.
-
National Music Forum
PublicationText on National Music Forum in Wroclaw.
-
Environmental Protection in Energetics (PG_00049751), W, ET, sem.1, winter 2024/2025
e-Learning CoursesEnvironmental aspects of energy production and processing.
-
Agile Commerce in the light of Text Mining
PublicationThe survey conducted for this study reveals that more than 84% of respondents have never encountered the term “agile commerce” and do not understand its meaning. At the same time, they are active participants of this strategy. Using digital channels as customers more often than ever before, they have already been included in the agile philosophy. Based on the above, the purpose of the study is to analyse major text sets containing...