You have a large CSV, you’re going to be reading it in to Pandas—but every time you load it, you have to wait for the CSV to load. And that slows down your development feedback loop, and might meaningfully slows down your production processing.
But it’s faster to read the data in faster. Let’s see how.
In this article we’ll cover:
- Pandas’ default CSV reading.
- The faster, more parallel CSV reader introduced in v1.4.
- A different approach that can make things even faster.
from Planet Python
via read more
No comments:
Post a Comment