A
A
Alexander2015-12-22 09:31:42
Algorithms
Alexander, 2015-12-22 09:31:42

How to extract entity data from text?

Given: a text with data about the entity, for example, an ad - "I will sell a used Ariston HotPoint v32.01 washing machine, excellent condition. Call 79871233445, Alexey. 7000 rubles."
It is necessary to extract data from the text, such as: name, model, state, contact and price.
What algorithms, libraries, frameworks, technologies can be used for this?

Answer the question

In order to leave comments, you need to log in

1 answer(s)
R
Roman Mirilaczvili, 2015-12-22
@2ord

I think the following text parsing tools are needed:

  1. Linguistic Analysis and Text Processing , Advanced Course in Natural Language Processing
  2. parser generator for natural language text
  3. compiling a dictionary of attributes: proper names, currency, cities. Based on the given classes of text attributes, find out the order for parsing the text. For example, in a given sentence, the presence of an action is determined (" buy " / "sell" / "exchange"), then " state ", contact (" call " and multi-digit number " 79871233445 ", local format), name " Alexey ", numbers " 7000 " + currency " rubles ". The name is the most difficult to define, and therefore it may make sense to define this attribute last.

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question