E
E
Elvis2018-05-29 18:19:29
Python
Elvis, 2018-05-29 18:19:29

What libraries to use to process large csv files in python?

There are several large csv files, about 1.5-2 gigabytes. These files need to be parsed and processed. What is the best to use? What libraries?

Answer the question

In order to leave comments, you need to log in

3 answer(s)
A
Alexey M, 2018-05-29
@al_mo

besides pandas, as Homon pointed out, numpy can also be used .
in some cases, working in numpy can be significantly faster than in pandas.
Also keep in mind that dataframes in pandas are, in principle, a slow thing, so if there is a need to often build queries a la sql, then you can raise a local postgresql and perform these operations there.

D
Dimonchik, 2018-05-29
@dimonchik2013

panda - standard, including all sorts of pivot tables,
but in general I recommend discovering clickhouse + tabix

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question