R
R
Rozello2018-03-10 15:47:59
Machine learning
Rozello, 2018-03-10 15:47:59

How to parse weakly formalized text?

There is a need to parse this list of drugs with descriptions and a list of drug synonyms.
As a result, I would like to see a json file with a code like this :
5aa3d3178b35b205945860.png
Using dom, css selectors and regular expressions, this cannot be done (more precisely, it does, but only partially), because you have to write rules for a very large number of exceptions in the form of typos, nested brackets, nested lists with different string formatting, etc.

Answer the question

In order to leave comments, you need to log in

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question