A Study on Named Entity Recognition with OpenNLP at English Texts

Authors

  • Metin Bilgin Bursa UludaÄŸ University

DOI:

https://doi.org/10.33633/jais.v4i1.2096

Abstract

Named entity recognition is a subject, inside of information retrieval which is a subdomain of natural processing. It pertains to identifying and labeling of location, person, organization, etc., inside of text content. Named entity recognition provides identifying and classifying of person, area, etc. inside of formal and informal text content and it can be used for different purposes as question answering systems and removal of the relation between events. In this work, named entity recognition is performed and one method is suggested and results are discussed for assignment to unlabeled name entities by using OpenNLP library with the help of KNIME program in the data set.

References

Nadeau, David, and Satoshi Sekine. "A survey of named entity recognition and classification." Lingvisticae Investigationes 30.1 (2007): 3-26.

https://imanage.com/blog/named-entity-recognition-ravn-part-1 - Online : 15.02.2018.

Eken, Beyza. Named entity recognition on Turkish short texts. Master Diss.Graduate School of Natural And Applied Sciences, 2015.

Rau, L.F., “Extracting Company Names from Textâ€, In Proc. Conference on Artificial Intelligence Applications of IEEE, 1991.

Chiu, Jason PC, and Eric Nichols. "Named entity recognition with bidirectional LSTM-CNNs." Transactions of the Association for Computational Linguistics 4 (2016): 357-370.

Derczynski, Leon, et al. "Analysis of named entity recognition and linking for tweets." Information Processing & Management51.2 (2015): 32-49.

Santos, Cicero Nogueira dos, and Victor Guimaraes. "Boosting named entity recognition with neural character embeddings." arXiv preprint arXiv:1505.05008 (2015).

Marrero, Mónica, et al. "Named entity recognition: fallacies, challenges and opportunities." Computer Standards & Interfaces35.5 (2013): 482-489.

Shaalan, Khaled. "A survey of arabic named entity recognition and classification." Computational Linguistics 40.2 (2014): 469-510.

Bikel, D.M., Miller, S., Schwartz, R., Weischedel, R., “A High-Performance Learning Name-finderâ€, In Proc. Conference on Applied Natural Language Processing, 1997.

Sekine, S., “Description of the Japanese NE System Used For Met-2â€, In Proc.Message Understanding Conference, 1998.

Borthwick, A., Sterling, J., Agichtein, E., Grishman, R., “Description of the MENE Named Entity System as used in MUC-7â€, In Proc. Seventh Message Understanding Conference, 1998.

Asahara, M., Matsumoto, Y., “Japanese Named Entity Extraction with Redundant Morphological Analysisâ€, In Proc. Human Language Technology Conference – North American chapter of the Association for Computational Linguistics, 2003.

McCallum, A., Li, W., “Early Results for Named Entity Recognition with Conditional Random Fields, Features Induction and Web-Enhanced Lexiconsâ€, In Proc. Conference on Computational Natural Language Learning, 2003.

Minkov, E., Wang, R., Cohen, W., â€Extracting Personal Names form e-mail: Applying Name Entity Recognition to Informal Textâ€, In Proc. Human Language Technology and Conference on Empirical Method in Natural Language Processing, 2005.

Brin, S., “Extracting Patterns and Relations from the World Wide Webâ€, In Proc.Conference of Extending Database Technology. Workshop on the Web and Databases, 1998.

Alfonseca, E., Manandhar, S., “An Unsupervised Method for General Named Entity Recognition and Automated Concept Discoveryâ€, In Proc. International Conference on General WordNet, 2002.

Cucerzan, S., Yarowsky, D., “Language independent named entity recognition combining morphological and contextual evidenceâ€, In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, 1999.

https://www.kaggle.com/steelrat11/ner-project/data - Online 15.01.2018

Khanna, S. and Agarwal, S., “An Integrated Approach towards the prediction of Likelihood of Diabetes,†Proc. Machine Intelligence and Research Advancement, Katra, India, pp. 294-298, 2013.

Mahardhika, Y. M., Sudarsono, A. and Barakbah, A. R., “An implementation of Botnet dataset to predict accuracy based on network flow model,†Proc. Knowledge Creation and Intelligent Computing, Surabaya, Indonesia, pp. 33-39, 2017.

Raut, M. Y. and Barve, S. S., “A semi-automated review classification system based on supervised machine learning,†Proc. 1st. Intelligent Systems and Information Management, Aurangabad, India, pp. 127-133, 2017.

Downloads

Published

2019-07-16