Search results for: NATURAL LANGUAGE PROCESSING, LARGE LANGUAGE MODELS, DATA MINING, QUANTUM PHYSICS
-
Vident-synth: a synthetic intra-oral video dataset for optical flow estimation
Open Research DataWe introduce Vident-synth, a large dataset of synthetic dental videos with corresponding ground truth forward and backward optical flows and occlusion masks. It can be used for:
-
Experimental study on models of cylindrical steel tanks under mining tremors and moderate earthquakes
PublicationThe aim of the study is to show the results of complex shaking table experimental investigation focused on the response of two models of cylindrical steel tanks under mining tremors and moderate earthquakes, including the aspects of diagnosis of structural damage. Firstly, the impact and the sweep-sine tests have been carried out, so as to determine the dynamic properties of models filled with different levels of liquid. Then,...
-
SoundShape - Headphone Transfer Function database
Open Research DataThis publication introduces the SoundShape database, which contains closed-ear headphone transfer functions (HpTF) for fifteen headphone models. Several models included in this database are also found in other well-known databases, such as Virtuoso and Binaural Decoders. However, for some models found in the literature, HpTF filters were unavailable,...
-
Processing of Satellite Data in the Cloud
PublicationThe dynamic development of digital technologies, especially those dedicated to devices generating large data streams, such as all kinds of measurement equipment (temperature and humidity sensors, cameras, radio-telescopes and satellites – Internet of Things) enables more in-depth analysis of the surrounding reality, including better understanding of various natural phenomenon, starting from atomic level reactions, through macroscopic...
-
Automated Valuation Model based on fuzzy and rough set theory for real estate market with insufficient source data
PublicationObjective monitoring of the real estate value is a requirement to maintain balance, increase security and minimize the risk of a crisis in the financial and economic sector of every country. The valuation of real estate is usually considered from two points of view, i.e. individual valuation and mass appraisal. It is commonly believed that Automated Valuation Models (AVM) should be devoted to mass appraisal, which requires a large...
-
Decision making techniques for electronic communication: an example for Turkey
PublicationCommunication is the way for people exchanging information with each other by using various tools. Electronic communication or Ecommunication is the process of sending, receiving and processing information or messages electronically. Electronic communication that is closely related to the development levels of countries, has made considerable progress especially in terms technology, innovation and entrepreneur. In this study, it...
-
[Chapter] 22. Application of physical modeling to study combustion process-es and flow patterns in large-scale boilers and furmaces. W: Optical me- thods and data processing in heat and fluid flow. Ed. C. Greated, J. Cos-grove, J.M. Buick. Bury St. Edmunds. London: Profess. Eng. Publ.**2002 s. 267-277, 4 rys. bibliogr. 6 poz. Zastosowanie modelowania fizycznego do badania procesów spalania pola prze- pływu w przemysłowych kotłach i piecach.
PublicationRozdział zawiera wyniki badań modelowania fizycznego dwuwymiarowego i trój-wymiarowego pola przepływu i procesów spalania w wybranych urządzeniachprzemysłowych.
-
Performance Analysis of the OpenCL Environment on Mobile Platforms
PublicationToday’s smartphones have more and more features that so far were only assigned to personal computers. Every year these devices are composed of better and more efficient components. Everything indicates that modern smartphones are replacing ordinary computers in various activities. High computing power is required for tasks such as image processing, speech recognition and object detection. This paper analyses the performance of...
-
2023_Reinventing Gdansk_Elective seminar
e-Learning CoursesThe workshop will examine important modern architecturalbuildings in Gdansk in different political, cultural, economicand environmental contexts. The students will be dividedinto groups. Each group will be led by the curator of theexhibition. They will identify and evaluate the architecturalvalues of the building through analyses, deconstructions andsyntheses, focusing on understanding the basic principles ofarchitecture and establishing...
-
Iterative Global Sensitivity Analysis Algorithm with Neural Network Surrogate Modeling
PublicationGlobal sensitivity analysis (GSA) is a method to quantify the effect of the input parameters on outputs of physics-based systems. Performing GSA can be challenging due to the combined effect of the high computational cost of each individual physics-based model, a large number of input parameters, and the need to perform repetitive model evaluations. To reduce this cost, neural networks (NNs) are used to replace the expensive physics-based...
-
Swapping Space for Time: An Alternative to Time-Domain Interferometry
PublicationYoung's double-slit experiment [1] requires two waves produced simultaneously at two different points in space. In quantum mechanics the waves correspond to a single quantum object, even as complex as a big molecule. An interference is present as long as one cannot tell for sure which slit is chosen by the object. The more we know about the path, the worse the interference. In the paper we show that quantum mechanics allows for...
-
Elgold partial: Automotive blogs
Open Research DataThe dataset contains 34 English texts scrapped from automotive blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and...
-
Elgold partial: Movie reviews
Open Research DataThe dataset contains 37 English texts with movie reviews. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Elgold partial: Job offers
Open Research DataThe dataset contains 34 English texts scrapped from the web portals offering job offers. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity...
-
Elgold partial: Scientific papers' abstracts
Open Research DataThe dataset contains 87 Scientific papers' abstracts in English randomly chosen from the folowing scientific disciplines: Biomedicine, Life Sciences, Mathematics, Medicine, Science, Humanities, Social Science.
-
Elgold partial: Amazon product reviews
Open Research DataThe dataset contains 34 Amazon product reviews in English. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Elgold partial: History blogs
Open Research DataThe dataset contains 13 texts from English history blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Instructor Presence in Video Lectures: Preliminary Findings From an Online Experiment
PublicationMotivation. Despite the widespread use of video lectures in online and blended learning environments, there is still debate whether the presence of an instructor in the video helps or hinders learning. According to social agency theory, seeing the instructor makes learners believe that s/he is personally teaching them, which leads to deeper cognitive processing and, in turn, better learning outcomes. Conversely, according to cognitive...
-
Objectivity in the non-Markovian spin-boson model
PublicationObjectivity constitutes one of the main features of the macroscopic classical world. An important aspect of the quantum-to-classical transition issue is to explain how such a property arises from the microscopic quantum theory. Recently, within the framework of open quantum systems, there has been proposed such a mechanism in terms of the so-called spectrum broadcast structures. These are multipartite quantum states of the system...
-
Preliminary Citation and Topic Analysis of International Conference on Agile Software Development Papers (2002-2018)
PublicationThis study utilizes citation analysis and automated topic analysis of papers published in International Conference on Agile Software Development (XP) from 2002 to 2018. We collected data from Scopus database, finding 789 XP papers. We performed topic and trend analysis with R/RStudio utilizing the text mining approach, and used MS Excel for the quantitative analysis of the data. The results show that the first five years of XP...
-
Using FreeFEM open software for modelling the vibrations of piezoelectric devices
PublicationModelling vibrations of piezoelectric transducers has been a topic discussed in the literature for many decades. The first models - so-called one-dimensional - describe the vibrations only near operating frequency and near its harmonics. Attempts to introduce two-dimensional models were related to the possibility of one transducer working at several frequencies, including both thickness vibrations and those resulting from the transducer...
-
MERPSYS: An environment for simulation of parallel application execution on large scale HPC systems
PublicationIn this paper we present a new environment called MERPSYS that allows simulation of parallel application execution time on cluster-based systems. The environment offers a modeling application using the Java language extended with methods representing message passing type communication routines. It also offers a graphical interface for building a system model that incorporates various hardware components such as CPUs, GPUs, interconnects...
-
Surrogate Modeling and Optimization Using Shape-Preserving Response Prediction: A Review
PublicationComputer simulation models are ubiquitous in modern engineering design. In many cases, they are the only way to evaluate a given design with sufficient fidelity. Unfortunately, an added computa-tional expense is associated with higher fidelity models. Moreover, the systems being considered are often highly nonlinear and may feature a large number of designable parameters. Therefore, it may be impractical to solve the design problem...
-
NLP Questions Answering Using DBpedia and YAGO
PublicationIn this paper, we present results of employing DBpedia and YAGO as lexical databases for answering questions formulated in the natural language. The proposed solution has been evaluated for answering class 1 and class 2 questions (out of 5 classes defined by Moldovan for TREC conference). Our method uses dependency trees generated from the user query. The trees are browsed for paths leading from the root of the tree to the question...
-
Application of the finite element methods in long-term simulation of the multi-physics systems with large transient response differences
PublicationApplication of the Finite Element Method (FEM) and the Multibody Dynamics Method allows analyzing of complex physical systems. Complexity of the system could be related both to the geometry and the physical description of phenomenon. The metod is the excellent tool for analyzing statics or dynamics of the mechanical systems, and permits tracking of Multi Body System (MBS) transient response for the long-term simulations and application...
-
A Multi-Fidelity Surrogate-Model-Assisted Evolutionary Algorithm for Computationally Expensive Optimization Problems
PublicationIntegrating data-driven surrogate models and simulation models of different accuracies (or fideli-ties) in a single algorithm to address computationally expensive global optimization problems has recently attracted considerable attention. However, handling discrepancies between simulation models with multiple fidelities in global optimization is a major challenge. To address it, the two major contributions of this paper include:...
-
Binary-Encounter Model for Direct Ionization of Molecules by Positron-Impact
PublicationWe introduce two models for the computation of direct ionization cross sections by positron impact over a wide range of collision energies. The models are based on the binary-encounter-Bethe model and take into account an extension of the Wannier theory. The cross sections computed with these models show good agreement with experimental data. The extensions improve the agreement between theory and experiment for collision energies...
-
Increased Certification of Semi-device Independent Random Numbers using Many Inputs and More Postprocessing
PublicationQuantum communication with systems of dimension larger than two provides advantages in information processing tasks. Examples include higher rates of key distribution and random number generation. The main disadvantage of using such multi-dimensional quantum systems is the increased complexity of the experimental setup. Here, we analyze a not-so-obvious problem: the relation between randomness certification and computational requirements...
-
Big Data and the Internet of Things in Edge Computing for Smart City
PublicationRequests expressing collective human expectations and outcomes from city service tasks can be partially satisfied by processing Big Data provided to a city cloud via the Internet of Things. To improve the efficiency of the city clouds an edge computing has been introduced regarding Big Data mining. This intelligent and efficient distributed system can be developed for citizens that are supposed to be informed and educated by the...
-
Human verbal memory encoding is hierarchically distributed in a continuous processing stream
PublicationProcessing of memory is supported by coordinated activity in a network of sensory, association, and motor brain regions. It remains a major challenge to determine where memory is encoded for later retrieval. Here we used direct intracranial brain recordings from epilepsy patients performing free recall tasks to determine the temporal pattern and anatomical distribution of verbal memory encoding across the entire human cortex. High...
-
Simulation of parallel similarity measure computations for large data sets
PublicationThe paper presents our approach to implementation of similarity measure for big data analysis in a parallel environment. We describe the algorithm for parallelisation of the computations. We provide results from a real MPI application for computations of similarity measures as well as results achieved with our simulation software. The simulation environment allows us to model parallel systems of various sizes with various components...
-
Knowledge-Based Virtual Modeling and Simulation of Manufacturing Processes for Industry 4.0
PublicationABSTRACT Industry 4.0 aims at providing a digital representation of a production landscape, but the challenges in building, maintaining, optimizing, and evolving digital models in inter-organizational production chains have not been identified yet in a systematic manner. In this paper, various Industry 4.0 research and technical challenges are addressed, and their present scenario is discussed. Moreover, in this article, the novel...
-
Mural i jego rola w przestrzeni zurbanizowanej
PublicationCzym tak naprawdę jest mural? Definicji jest bardzo wiele. Według Słownika języka polskiego PWN mural to „wielkie malowidło wykonane bezpośrednio na ścianie budynku”. Pierwotnymi malowidłami tego typu były prace naskalne z epoki paleolitu. Następnie ważnymi epokami dla rozwoju tego typu prac był starożytny Egipt i starożytny Rzym. Samo słowo „mural” pochodzi z języka hiszpańskiego (h. mural – ścienny; malarstwo ścienne). To dzięki...
-
A distributed system for conducting chess games in parallel
PublicationThis paper proposes a distributed and scalable cloud based system designed to play chess games in parallel. Games can be played between chess engines alone or between clusters created by combined chess engines. The system has a built-in mechanism that compares engines, based on Elo ranking which finally presents the strength of each tested approach. If an approach needs more computational power, the design of the system allows...
-
Nieliniowa statyka 6-parametrowych powłok sprężysto plastycznych. Efektywne obliczenia MES
PublicationGłównym zagadnieniem omawianym w monografii jest sformułowanie sprężysto-plastycznego prawa konstytutywnego w nieliniowej 6-parametrowej teorii powłok. Wyróżnikiem tej teorii jest występujący w niej w naturalny sposób tzw. stopień 6 swobody, czyli owinięcie (drilling rotation). Podstawowe założenie pracy to przyjęcie płaskiego stanu naprężenia uogólnionego na ośrodek typu Cosseratów. Takie podejście stanowi oryginalny aspekt opracowania....
-
A simplified behavioral MOSFET model based on parameters extraction for circuit simulations.
PublicationThe paper presents results on behavior modeling of general purpose Metal-Oxide Semiconductor Field-Effect Transistor (MOSFET) for simulation of power electronics systems requiring accuracy both in steady-state and in switching conditions. Methods of parameters extraction including nonlinearity of parasitic capacitances and steady-state characteristics are based on manufacturer data sheet and externally measurable characteristics....
-
Analysis of results of large-scale multimodal biometric identity verification experiment
PublicationAn analysis of a large set of biometric data obtained during the enrolment and the verification phase in an experimental biometric system installed in bank branches is presented. Subjective opinions of bank clients and of bank tellers were also surveyed concerning the studied biometric methods in order to discover and to explore relations emerging from the obtained multimodal dataset. First, data acquisition and identity verification...
-
An Analysis of Neural Word Representations for Wikipedia Articles Classification
PublicationOne of the current popular methods of generating word representations is an approach based on the analysis of large document collections with neural networks. It creates so-called word-embeddings that attempt to learn relationships between words and encode this information in the form of a low-dimensional vector. The goal of this paper is to examine the differences between the most popular embedding models and the typical bag-of-words...
-
YADE - An extensible framework for the interactive simulation of multiscale, multiphase, and multiphysics particulate systems
PublicationThis contribution presents the key elements of YADE, an extensible open-source framework for dynamic simulations. During the past 19 years, YADE has evolved from "Yet Another Dynamic Engine"' to a versatile multiscale and multiphysics solver, counting a large, active, and growing community of users and developers. The computationally intense parts of the source code are written in C++, using flexible object models that allow for...
-
Using wavelet techniques for multibeam sonar bathymetry data compression
PublicationMultibeam sonars are widely used in applications like high resolution bathymetry measurements or underwater object imaging. One of the significant problems in multibeam sensing of the marine environment is large amount of data which must be transmitted from the sonar processing unit to an operator station using a limited bit rate channel. For instance, such a situation would be in the case when the multibeam sonar was mounted on...
-
On the compression of multibeam sonar raw bathymetry data
PublicationMultibeam sonars are widely used in applications like high resolution bathymetry measurements or underwater object imaging. One of the significant problems in multibeam sensing of the marine environment is large amount of data which must be transmitted from the sonar processing unit to an operator station using a limited bit rate channel. For instance, such a situation would be in the case when the multibeam sonar was mounted on...
-
Weak localization competes with the quantum oscillations in a natural electronic superlattice: The case of Na1.5(PO2)4(WO3)20
PublicationWe report an investigation of the combined structural and electronic properties of the bronze Na1.5(PO2)4(WO3)20. Its low-dimensional structure and possible large reconstruction of the Fermi surface due to charge density wave instability make this bulk material a natural superlattice with a reduced number of carriers and Fermi energy. Signatures of multilayered two-dimensional (2D) electron weak localization are consequently reported,...
-
Spin-Orbit Coupling Matrix Elements in the KRb Molecule
Open Research DataThe allowed 190 spin-orbit coupling (SOC) matrix elements have been calculated for the singlet (s) and triplet (t) Sigma+ (S+), Pi (P), and Delta (D) electronic states of the KRb molecule. These SOCs are needed for investigations of areas connected with classical spectroscopy, deperturbation analysis of the observed spectra, atom-molecule and molecule-molecule...
-
IP Core of Coprocessor for Multiple-Precision-Arithmetic Computations
PublicationIn this paper, we present an IP core of coprocessor supporting computations requiring integer multiple-precision arithmetic (MPA). Whilst standard 32/64-bit arithmetic is sufficient to solve many computing problems, there are still applications that require higher numerical precision. Hence, the purpose of the developed coprocessor is to support and offload central processing unit (CPU) in such computations. The developed digital...
-
Multimodal system for diagnosis and polysensory stimulation of subjects with communication disorders
PublicationAn experimental multimodal system, designed for polysensory diagnosis and stimulation of persons with impaired communication skills or even non-communicative subjects is presented. The user interface includes an eye tracking device and the EEG monitoring of the subject. Furthermore, the system consists of a device for objective hearing testing and an autostereoscopic projection system designed to stimulate subjects through their...
-
Incremental construction of Minimal Tree Automata [online]
PublicationWe describe an algorithm that allows the incremental addition or removal of unranked ordered trees to minimal frontier-to-root deterministic tree automaton (DTA). The algorithm takes a tree t and a minimal DTA A as input; it outputs a minimal DTA A' which accepts the language L(A) accepted by A incremented (or decremented) with the tree t. The algorithm can be used to efficiently maintain dictionaries which store large collections...
-
General concept of reduction process for big data obtained by interferometric methods
PublicationInterferometric sonar systems apply the phase content of the sonar signal to measure the angle of a wave front returned from the seafloor or from a target. It collect a big data – datasets that are so large or complex that traditional data processing application software is inadequate to deal with them. The recording a large number of data is associated with the difficulty of their efficient use. So data have to be reduced. The main...
-
Correction of far-field measurements obtained in non-anechoic test site
Open Research DataThe dataset contains raw and processed measurements of radiation pattern characteristics performed in non-anechoic regime for two geometrically small antenna structures: a spline-parameterized Vivaldi structure and a compact spline-based monopole. The responses have been obtained at the selected frequencies of interest as a function of mentioned structures...
-
Using Rule-Based System for Monitoring Marine Navigation Data Processing
PublicationProcessing marine navigational data requires sophisticated software solutions. Typically, specialized tools called processors are analyzing raw data from different sensors. It becomes important to create the monitoring software that is able to validate and verify processing components integrated into the final system. Drools®business rule management platform provides a core business rules engine, web authoring and rules management...
-
Process of Medical Dataset Construction for Machine Learning-Multifield Study and Guidelines
PublicationThe acquisition of high-quality data and annotations is essential for the training of efficient machine learning algorithms, while being an expensive and time-consuming process. Although the process of data processing and training and testing of machine learning models is well studied and considered in the literature, the actual procedures of obtaining data and their annotations in collaboration with physicians are in most cases...