M
M
Margarita Lionova2022-03-28 04:10:59
Python
Margarita Lionova, 2022-03-28 04:10:59

How to scrap a product?

I am a complete 0 in parsing, that is, in general, the bottom.

There is a link to the product on the Chinese taobao website, let's say - https://item.taobao.com/item.htm?spm=a1z10.3-cs.w...

From here you need to parse the price, photo, colors, sizes, bottom photos , in general, all the signed elements from this screenshot

6241094c16835991780377.png

And bring this data into a table so that it looks like this There are

6241096bd2967459020169.jpeg

thousands of such links to goods, so help is needed. Apart from the word parsing, I know nothing more on this topic.
I am not asking you to write down how to do this step by step (although it would be nice), I ask you to direct me to study something that will help me in this task, maybe some course from some school or some videos on this topic, some articles and so on. Perhaps there is some thematic community that deals with something like this and I can ask around there, I will be grateful for any help. If you are ready to take on work for payment, then write to me on VK, link in profile.

Answer the question

In order to leave comments, you need to log in

1 answer(s)
I
i3a4em, 2022-03-28
@margobuiss

1. You can get raw data for parsing using the requests
library . At the output, get the html source of the page.
2. You can parse data from html using regular expressions
In most cases, this should be enough for you.
Regular games, of course, are a difficult topic for a beginner. But since they started to write a parser, then without them, well, absolutely nothing

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question