Skip to content

Latest commit

 

History

History
70 lines (58 loc) · 2.02 KB

File metadata and controls

70 lines (58 loc) · 2.02 KB

Python

PyArrow - Apache Arrow Python bindings

This is the documentation of the Python API of Apache Arrow.

Apache Arrow is a universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics. It contains a set of technologies that enable data systems to efficiently store, process, and move data.

See the :doc:`parent documentation <../index>` for additional details on the Arrow Project itself, on the Arrow format and the other language bindings.

The Arrow Python bindings (also named "PyArrow") have first-class integration with NumPy, pandas, and built-in Python objects. They are based on the C++ implementation of Arrow.

Here we will detail the usage of the Python API for Arrow and the leaf libraries that add additional functionality such as reading Apache Parquet files into Arrow structures.

.. toctree::
   :maxdepth: 2

   install
   getstarted
   data
   compute
   memory
   ipc
   filesystems
   numpy
   pandas
   interchange_protocol
   dlpack
   timestamps
   orc
   csv
   feather
   json
   parquet
   dataset
   flight
   extending_types
   integration
   env_vars
   api
   getting_involved
   Python cookbook <https://arrow.apache.org/cookbook/py/>