News

When saving dataframes to disk, choosing an efficient file format can affect memory usage upon reloading. File formats like HDF5 or Parquet are designed for storing large datasets efficiently.
The convert2parquet function is used to convert large CSV files into smaller Parquet files. It offers several advantages such as reducing memory usage, improving processing speed, and optimizing data ...
Define a generator function to read lines from the CSV file one by one, avoiding loading the entire file into memory. Use data structures that minimize memory usage, such as generators, iterators ...
This Python package converts .csv files to .parquet files using the astropy and pandas libraries. The resulting .parquet files are compressed and can be read faster than uncompressed .csv files, and ...