Answer the question
In order to leave comments, you need to log in
How to build basic frequency lists using stop lists?
Hello. I'll tell you right away. There is a task, the condition of which is not clear:
Take the texts of two similar topics (movies and serials).
1) Build basic frequency lists using a standard stop list.
2) Look at the results, adjust the stop lists.
3) Build frequency lists again, compare results.
I'm not asking for a solution, I'm asking you to explain what is required in the assignment. What does frequency list mean? I have a stop list that contains words. A frequency list is a list that indicates the number of stop words in a given text?
And what does it mean to "correct" the stop lists? Add words that you haven't seen before?
Questions can be stupid, I did not quite understand what was required of me.
Answer the question
In order to leave comments, you need to log in
A frequency dictionary (or frequency list) is a set of words in a given language (or sublanguage) along with information about their frequency of occurrence.
Most likely it is necessary to remove all stop words from the texts. for example like this , and then do a frequency analysis like this . If there are a lot of unnecessary words in the resulting frequency list: interjections, prepositions... then our stop list does not process them, and they must be added there
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question