Abstract
Natural Language Processing (NLP) finds many usages in different fields of endeavor. Many tools exists allowing analysis of English language. For Polish language the situation is different as the language itself is more complicated. In this paper we show differences between NLP of Polish and English language. Existing solutions are presented and TEAMS software for facts extraction is described. The paper shows also evaluation of the proposed solution and the tools used. Finally some conclusions are given.
Citations
-
2
CrossRef
-
0
Web of Science
-
3
Scopus
Authors (2)
Cite as
Full text
download paper
downloaded 259 times
- Publication version
- Accepted or Published Version
- License
- Copyright (2017 European Union)
Keywords
Details
- Category:
- Conference activity
- Type:
- materiały konferencyjne indeksowane w Web of Science
- Title of issue:
- 2017 IEEE International Conference on INnovations in Intelligent SysTems and Applications (INISTA) strony 13 - 17
- Language:
- English
- Publication year:
- 2017
- Bibliographic description:
- Boiński T. M., Chojnowski A..: Towards facts extraction from text in Polish language, W: 2017 IEEE International Conference on INnovations in Intelligent SysTems and Applications (INISTA), 2017, ,.
- DOI:
- Digital Object Identifier (open in new tab) 10.1109/inista.2017.8001124
- Bibliography: test
-
- C. D. Manning, P. Raghavan, H. Schütze et al., Introduction to infor- mation retrieval. Cambridge university press Cambridge, 2008, vol. 1, no. 1.
- M. Bates, "The theory and practice of augmented transition net- work grammars," in Natural language communication with computers. Springer, 1978, pp. 191-254. open in new tab
- C. D. Manning, H. Schütze et al., Foundations of statistical natural language processing. MIT Press, 1999, vol. 999.
- F. Zanettin, Translation-driven corpora: Corpus resources for descrip- tive and applied translation studies. Routledge, 2014. open in new tab
- J. Szymański, "Words context analysis for improvement of information retrieval," in International Conference on Computational Collective Intelligence. Springer, 2012, pp. 318-325. open in new tab
- W. Duch, J. Szymański, and T. Sarnatowicz, "Concept description vectors and the 20 question game," in Intelligent Information Processing and Web Mining. Springer, 2005, pp. 41-50. open in new tab
- J. Szymański, "Self-organizing map representation for clustering wikipedia search results," in Asian Conference on Intelligent Information and Database Systems. Springer, 2011, pp. 140-149. open in new tab
- A. Carlson, J. Betteridge, B. Kisiel, B. Settles, E. R. Hruschka Jr, and T. M. Mitchell, "Toward an architecture for never-ending language learning." in AAAI, vol. 5, 2010, p. 3.
- A. Gangemi, V. Presutti, D. Reforgiato Recupero, A. G. Nuzzolese, F. Draicchio, and M. Mongiovì, "Semantic web machine reading with fred," Semantic Web, no. Preprint, pp. 1-21, 2016. open in new tab
- A. Gangemi, V. Presutti, F. Draicchio, A. Musetti, and A. Nuz- zolese, "TIPALO," http://wit.istc.cnr.it/stlab-tools/tipalo, 2012, [Online: 10.05.2017].
- M. Bańko, "Word order in sentence, (in Polish)," http://sjp.pwn. pl/poradnia/haslo/szyk-wyrazow-w-zdaniu;14378.html, 2013, [Online: 10.05.2017]. open in new tab
- --, "Adjective order in sentence, (in Polish)," http://sjp.pwn. pl/poradnia/haslo/szyk-przymiotnikow;12630.html, 2013, [Online: 10.05.2017]. open in new tab
- D. Feng, Factorizing information extraction from text corpora. Pro- Quest, 2007.
- A. Savary, M. Ogrodniczuk, M. Zawislawska, K. Glowinska, and M. Kopec, Coreference: Annotation, Resolution and Evaluation in Polish. Walter de Gruyter GmbH & Co KG, 2015. open in new tab
- D. A. Ferrucci, "Ibm's watson/deepqa," in ACM SIGARCH Computer Architecture News, vol. 39, no. 3. ACM, 2011. open in new tab
- D. Ferrucci, A. Levas, S. Bagchi, D. Gondek, and E. T. Mueller, "Watson: beyond jeopardy!" Artificial Intelligence, vol. 199, pp. 93- 105, 2013. open in new tab
- M. Woliński, "Komputerowa weryfikacja gramatykiświdzińskiego," Ph.D. dissertation, Instytut Podstaw Informatyki PAN, Warszawa, 2004.
- M.Świdziński, Gramatyka formalna języka polskiego. Wydawn.
- Uniwersytetu Warszawskiego, 1992, no. 349.
- M. Piasecki, "Polish tagger takipi: Rule based construction and optimi- sation," Task Quarterly, vol. 11, no. 1-2, pp. 151-167, 2007. open in new tab
- Z. Saloni, W. Gruszczyński, M. Woliński, R. Wołosz, and D. Skowrońska, "Morfeusz, (in Polish)," http://sgjp.pl/morfeusz/ index.html, 2013, [Online: 10.05.2017].
- D. Czerski, K. Ciesielski, M. Dramiński, M. Kłopotek, P. Łoziński, and S. Wierzchoń, "What nekst?âȂŤsemantic search engine for polish internet," in Challenging Problems and Solutions in Intelligent Systems. Springer, 2016, pp. 335-347. open in new tab
- M. Ogrodniczuk and M. Lenart, "Web service integration platform for polish linguistic resources." in LREC, 2012, pp. 1164-1168.
- J. Waszczuk, "Harnessing the crf complexity with domain-specific constraints. the case of morphosyntactic tagging of a highly inflected language." in COLING, 2012, pp. 2789-2804.
- A. Buczyński and A. Przepiórkowski, "Spejd: A shallow processing and morphological disambiguation tool," in Language and Technology Conference. Springer, 2007, pp. 131-141. open in new tab
- J. Waszczuk, K. Głowińska, A. Savary, A. Przepiórkowski, and M. Lenart, "Annotation tools for syntax and named entities in the national corpus of polish," International Journal of Data Mining, Mod- elling and Management, vol. 5, no. 2, pp. 103-122, 2013. open in new tab
- M. Kopec and J. Kazimierza, "Zero subject detection for polish." in EACL, 2014, pp. 221-225. open in new tab
- A. Wróblewska and A. Przepiórkowski, "Projection-based annotation of a Polish dependency treebank," in Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC 2014, N. Calzolari, K. Choukri, T. Declerck, H. Loftsson, B. Maegaard, J. Mariani, A. Moreno, J. Odijk, and S. Piperidis, Eds. Reykjavík, Iceland: ELRA, 2014, pp. 2306-2312. [Online]. Available: http://www.lrec-conf.org/proceedings/lrec2014/index.html
- E. Rudnicka, W. Witkowski, M. Kaliński et al., "Towards the method- ology for extending princeton wordnet," Cognitive Studies| Études cognitives, no. 15, pp. 335-351, 2015. open in new tab
- Wikipedia, "Polyp, (in Polish)," 2017, [Online: 10.05.2017].
- --, "Cnidoblasts, (in Polish)," 2017, [Online: 10.05.2017].
- --, "Box jellyfish, (in Polish)," 2017, [Online: 10.05.2017]. [Online]. open in new tab
- --, "Medusa, (in Polish)," 2017, [Online: 10.05.2017].
- --, "Anemones, (in Polish)," 2017, [Online: 10.05.2017].
- --, "Wojciech Jaruzelski, (in Polish)," 2017, [Online: 10.05.2017]. [Online]. Available: \url{https://pl.wikipedia.org/wiki/ Wojciech_Jaruzelski}
- --, "Lech Wałȩsa, (in Polish)," 2017, [Online: 10.05.2017]. open in new tab
- --, "Aleksander Kwaśniewski, (in Polish)," 2017, [Online: 10.05.2017]. [Online]. Available: \url{https://pl.wikipedia.org/wiki/ Aleksander_Kwa%C5%9Bniewski} open in new tab
- --, "Lech Kaczyński, (in Polish)," 2017, [Online: 10.05.2017]. open in new tab
- [Online]. Available: \url{https://pl.wikipedia.org/wiki/Lech_Kaczy% C5%84ski} [38] --, "Bronisław Komorowski, (in Polish)," 2017, [Online: 10.05.2017]. [Online]. Available: \url{https://pl.wikipedia.org/wiki/ Bronis%C5%82aw_Komorowski} open in new tab
- --, "Andrzej Duda, (in Polish)," 2017, [Online: 10.05.2017]. open in new tab
- T. Boiński, "Game with a purpose for mappings verification," in Computer Science and Information Systems (FedCSIS), 2016 Federated Conference on. IEEE, 2016, pp. 405-409. open in new tab
- Verified by:
- Gdańsk University of Technology
seen 148 times
Recommended for you
Semantic OLAP with FluentEditor and Ontorion Semantic Excel Toolchain
- D. Dobrowolski,
- P. Kapłański,
- A. Marciniak
- + 1 authors
2015