Skip to content

This repository is Python benchmark for common usage in data science.

License

Notifications You must be signed in to change notification settings

cosmojg/python_performance_benchmark

 
 

Repository files navigation

Dataframe Disk Format Performance Benchmark

Forked from: https://github.com/lintseju/python_performance_benchmark

Pandas I/O

Data: n row x 4 columns random double dataframe.

Alt text

Remark:

  • time or size (Y axis) in log scale.
  • Maximum row for excel output is 1048575 rows, so file size and read speed of 10^7 are underestimated.

About

This repository is Python benchmark for common usage in data science.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 99.7%
  • Python 0.3%