Named Entity Recognition by Neural Prediction - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

Named Entity Recognition by Neural Prediction

Nouha Hammadi
  • Fonction : Auteur

Résumé

Named entity recognition (NER) remains a very challenging problem essentially when the document, where we perform it, is handwritten and ancient. Traditional methods using regular expressions or those based on syntactic rules, work but are not generic because they require, for each dataset, additional work of adaptation. We propose here a recognition method by context exploitation and tag prediction. We use a pipeline model composed of two consecutive BLSTMs (Bidirectional Long-Short Term Memory). The first one is a BLSTM-CTC coupling to recognize the words in a text line using a sliding window and HOG features. The second BLSTM serves as a language model. It cleverly exploits the gates of the BLSTM memory cell by deploying some syntactic rules in order to store the content around the proper nouns. This operation allows it to predict the tag of the next word, depending on its context, which is followed gradually until the discovery of the named entity (NE). All the words of the context are used to help the prediction. We have tested this system on a private dataset of Philharmonie de Paris, for the extraction of proper nouns within sale music transactions as well as on the public IAM dataset. The results are satisfactory, compared to what exists in the literature.
Fichier principal
Vignette du fichier
Nouha-paper.pdf (10.18 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01981613 , version 1 (22-10-2020)

Identifiants

  • HAL Id : hal-01981613 , version 1

Citer

Nouha Hammadi, Abdel Belaïd, Yolande Belaïd. Named Entity Recognition by Neural Prediction. International Conference on Image Processing, Computer Vision, & Pattern Recognition, Jul 2018, Las Vegas, United States. ⟨hal-01981613⟩
64 Consultations
50 Téléchargements

Partager

Gmail Facebook X LinkedIn More