Source: apache-arrow Section: devel Priority: optional Maintainer: Debian Science Maintainers Uploaders: Steffen Moeller , Sascha Steinbiss , Diane Trout , Dominique Belhachemi , Rene Engelhard Build-Depends: # Restrict to 64-bit little-endian architectures: upstream does not yet support others. architecture-is-64-bit, architecture-is-little-endian, # apache-arrow-testing , apache-parquet-testing , clang, clang-tidy, cmake, cython3, debhelper-compat (= 13), dh-python, dh-r, dh-sequence-numpy3, git, libboost-filesystem-dev, libbrotli-dev, libbz2-dev, libcurl4-openssl-dev, libgoogle-glog-dev, libgrpc++-dev, libgtest-dev, libjemalloc-dev, libmimalloc-dev, liblz4-dev, libre2-dev, libprotobuf-dev, libprotoc-dev, libs3-dev, libsnappy-dev, libssl-dev, libthrift-dev, libutf8proc-dev, libxsimd-dev, libzstd-dev, llvm, llvm-dev, ninja-build, pkgconf, protobuf-compiler-grpc, python3-thrift, python3-dev, python3-cffi , python3-dotenv , python3-git , python3-github , python3-hypothesis , python3-memory-profiler , python3-numpy-dev, python3-pandas , python3-pytest , python3-pytest-arraydiff , python3-pytest-cov , python3-responses , python3-semver , python3-setuptools-scm, python3-typeguard , python3-pytz , python3-wheel , r-base-dev, r-cran-assertthat, r-cran-bit64, r-cran-cpp11, r-cran-glue, r-cran-purrr, r-cran-r6, r-cran-rlang, r-cran-tidyselect, r-cran-vctrs, rapidjson-dev, tzdata, tzdata-legacy, zlib1g-dev Standards-Version: 4.7.3 Vcs-Browser: https://salsa.debian.org/science-team/arrow Vcs-Git: https://salsa.debian.org/science-team/arrow.git Homepage: https://arrow.apache.org/ # To be synced with dev/tasks/linux-packages/apache-arrow/debian/control.in Package: libarrow2300 Section: libs Architecture: any Multi-Arch: same Pre-Depends: ${misc:Pre-Depends} Depends: ${misc:Depends}, ${shlibs:Depends} Description: Apache Arrow data processing library Apache Arrow defines a language-independent columnar memory format for flat and nested data, organized for efficient analytic operations on modern hardware like CPUs and GPUs. The Arrow memory format also supports zero-copy reads for lightning-fast data access without serialization overhead. . Arrow's libraries implement the format and provide building blocks for a range of use cases, including high performance analytics. Many popular projects use Arrow to ship columnar data efficiently or as the basis for analytic engines. . This package provides C++ library files. Package: libarrow-acero2300 Section: libs Architecture: any Multi-Arch: same Pre-Depends: ${misc:Pre-Depends} Depends: ${misc:Depends}, ${shlibs:Depends}, libarrow2300 (= ${binary:Version}), Description: Apache library for processing large streams of data Apache Arrow defines a language-independent columnar memory format for flat and nested data, organized for efficient analytic operations on modern hardware like CPUs and GPUs. The Arrow memory format also supports zero-copy reads for lightning-fast data access without serialization overhead. . Arrow's libraries implement the format and provide building blocks for a range of use cases, including high performance analytics. Many popular projects use Arrow to ship columnar data efficiently or as the basis for analytic engines. . This package provides C++ library files for the Apache Acero module. Package: libarrow-dataset2300 Section: libs Architecture: any Multi-Arch: same Pre-Depends: ${misc:Pre-Depends} Depends: ${misc:Depends}, ${shlibs:Depends}, libarrow2300 (= ${binary:Version}), libparquet2300 (= ${binary:Version}) Description: Apache Arrow is a data processing library for analysis Apache Arrow defines a language-independent columnar memory format for flat and nested data, organized for efficient analytic operations on modern hardware like CPUs and GPUs. The Arrow memory format also supports zero-copy reads for lightning-fast data access without serialization overhead. . Arrow's libraries implement the format and provide building blocks for a range of use cases, including high performance analytics. Many popular projects use Arrow to ship columnar data efficiently or as the basis for analytic engines. . This package provides C++ library files for Dataset module. Package: libarrow-flight2300 Section: libs Architecture: any Multi-Arch: same Pre-Depends: ${misc:Pre-Depends} Depends: ${misc:Depends}, ${shlibs:Depends}, libarrow2300 (= ${binary:Version}) Description: Apache Arrow is a data processing library for analysis Apache Arrow defines a language-independent columnar memory format for flat and nested data, organized for efficient analytic operations on modern hardware like CPUs and GPUs. The Arrow memory format also supports zero-copy reads for lightning-fast data access without serialization overhead. . Arrow's libraries implement the format and provide building blocks for a range of use cases, including high performance analytics. Many popular projects use Arrow to ship columnar data efficiently or as the basis for analytic engines. . This package provides C++ library files for Flight RPC system. Package: libarrow-dev Section: libdevel Architecture: any Multi-Arch: same Depends: ${misc:Depends}, libarrow2300 (= ${binary:Version}), libbrotli-dev, libbz2-dev, libgrpc++-dev, liblz4-dev, libre2-dev, libsnappy-dev, libssl-dev, libutf8proc-dev, libzstd-dev, protobuf-compiler-grpc, libprotobuf-dev, libprotoc-dev, libthrift-dev, zlib1g-dev Description: Apache Arrow data processing library - header files Apache Arrow defines a language-independent columnar memory format for flat and nested data, organized for efficient analytic operations on modern hardware like CPUs and GPUs. The Arrow memory format also supports zero-copy reads for lightning-fast data access without serialization overhead. . Arrow's libraries implement the format and provide building blocks for a range of use cases, including high performance analytics. Many popular projects use Arrow to ship columnar data efficiently or as the basis for analytic engines. . This package provides C++ header files. Package: libarrow-acero-dev Section: libdevel Architecture: any Multi-Arch: same Depends: ${misc:Depends}, libarrow-dev (= ${binary:Version}), libarrow-acero2300 (= ${binary:Version}), Description: Apache Acero is a tool for processing large streams of data Apache Arrow defines a language-independent columnar memory format for flat and nested data, organized for efficient analytic operations on modern hardware like CPUs and GPUs. The Arrow memory format also supports zero-copy reads for lightning-fast data access without serialization overhead. . Arrow's libraries implement the format and provide building blocks for a range of use cases, including high performance analytics. Many popular projects use Arrow to ship columnar data efficiently or as the basis for analytic engines. . This package provides C++ header files for the Acero module. Package: libarrow-dataset-dev Section: libdevel Architecture: any Multi-Arch: same Depends: ${misc:Depends}, libarrow-dev (= ${binary:Version}), libarrow-dataset2300 (= ${binary:Version}), libparquet-dev (= ${binary:Version}) Description: Apache Arrow is a data processing library for analysis Apache Arrow defines a language-independent columnar memory format for flat and nested data, organized for efficient analytic operations on modern hardware like CPUs and GPUs. The Arrow memory format also supports zero-copy reads for lightning-fast data access without serialization overhead. . Arrow's libraries implement the format and provide building blocks for a range of use cases, including high performance analytics. Many popular projects use Arrow to ship columnar data efficiently or as the basis for analytic engines. . This package provides C++ header files for dataset module. Package: libarrow-flight-dev Section: libdevel Architecture: any Multi-Arch: same Depends: ${misc:Depends}, libarrow-dev (= ${binary:Version}), libarrow-flight2300 (= ${binary:Version}) Description: Apache Arrow is a data processing library for analysis Apache Arrow defines a language-independent columnar memory format for flat and nested data, organized for efficient analytic operations on modern hardware like CPUs and GPUs. The Arrow memory format also supports zero-copy reads for lightning-fast data access without serialization overhead. . Arrow's libraries implement the format and provide building blocks for a range of use cases, including high performance analytics. Many popular projects use Arrow to ship columnar data efficiently or as the basis for analytic engines. . This package provides C++ header files for Flight RPC system. Package: libarrow-flight-sql2300 Section: libs Architecture: any Multi-Arch: same Pre-Depends: ${misc:Pre-Depends} Depends: ${misc:Depends}, ${shlibs:Depends}, libarrow-flight2300 (= ${binary:Version}) Description: Apache Arrow Flight SQL library Apache Arrow defines a language-independent columnar memory format for flat and nested data, organized for efficient analytic operations on modern hardware like CPUs and GPUs. The Arrow memory format also supports zero-copy reads for lightning-fast data access without serialization overhead. . Arrow's libraries implement the format and provide building blocks for a range of use cases, including high performance analytics. Many popular projects use Arrow to ship columnar data efficiently or as the basis for analytic engines. . This package provides C++ library files for the Flight SQL extension. Package: libarrow-flight-sql-dev Section: libdevel Architecture: any Multi-Arch: same Depends: ${misc:Depends}, libarrow-flight-dev (= ${binary:Version}), libarrow-flight-sql2300 (= ${binary:Version}) Description: Apache Arrow Flight SQL library - development files Apache Arrow defines a language-independent columnar memory format for flat and nested data, organized for efficient analytic operations on modern hardware like CPUs and GPUs. The Arrow memory format also supports zero-copy reads for lightning-fast data access without serialization overhead. . Arrow's libraries implement the format and provide building blocks for a range of use cases, including high performance analytics. Many popular projects use Arrow to ship columnar data efficiently or as the basis for analytic engines. . This package provides C++ header files for the Flight SQL extension. Package: libgandiva2300 Section: libs Architecture: any Multi-Arch: same Pre-Depends: ${misc:Pre-Depends} Depends: ${misc:Depends}, ${shlibs:Depends} Description: Runtime expression compiler for Apache Arrow records Gandiva is a runtime expression compiler that uses LLVM to generate efficient native code for compute on Arrow record batches. Among the transformations of Arrow data files, Gandiva only handles projections and filters. . Gandiva was designed to take advantage of the Arrow memory format and modern hardware. From the Arrow memory model, since Arrow arrays have separate buffers for values and validity bitmaps, values and their null status can often be processed independently, allowing for better instruction pipelining. On modern hardware, compiling expressions using LLVM allows the execution to be optimized to the local runtime environment and hardware, including available SIMD instructions. To reduce optimization overhead, many Gandiva functions are pre-compiled into LLVM IR (intermediate representation). Package: libgandiva-dev Section: libdevel Architecture: any Multi-Arch: same Depends: ${misc:Depends}, libarrow-dev (= ${binary:Version}), libgandiva2300 (= ${binary:Version}) Description: header files for Apache Arrow Gandiva library Gandiva is a runtime expression compiler that uses LLVM to generate efficient native code for compute on Arrow record batches. Among the transformations of Arrow data files, Gandiva only handles projections and filters. . This package provides C++ header files. Package: libparquet2300 Section: libs Architecture: any Multi-Arch: same Pre-Depends: ${misc:Pre-Depends} Depends: ${misc:Depends}, ${shlibs:Depends} Description: Apache Parquet is a columnar storage format Apache Arrow defines a language-independent columnar memory format for flat and nested data, organized for efficient analytic operations on modern hardware like CPUs and GPUs. The Arrow memory format also supports zero-copy reads for lightning-fast data access without serialization overhead. . Arrow's libraries implement the format and provide building blocks for a range of use cases, including high performance analytics. Many popular projects use Arrow to ship columnar data efficiently or as the basis for analytic engines. . This package provides C++ library files to process the Apache Parquet format. Package: libparquet-dev Section: libdevel Architecture: any Multi-Arch: same Depends: ${misc:Depends}, libarrow-dev (= ${binary:Version}), libparquet2300 (= ${binary:Version}) Description: Apache Parquet is a columnar storage format Apache Arrow defines a language-independent columnar memory format for flat and nested data, organized for efficient analytic operations on modern hardware like CPUs and GPUs. The Arrow memory format also supports zero-copy reads for lightning-fast data access without serialization overhead. . Arrow's libraries implement the format and provide building blocks for a range of use cases, including high performance analytics. Many popular projects use Arrow to ship columnar data efficiently or as the basis for analytic engines. . This package provides C++ header files. Package: python3-pyarrow Section: python Architecture: any Multi-Arch: no Breaks: libarrow-python2300 (<< 23.0.1-2), libarrow-python-flight2300 (<< 23.0.1-2) Replaces: libarrow-python2300 (<< 23.0.1-2), libarrow-python-flight2300 (<< 23.0.1-2) Depends: ${misc:Depends}, ${shlibs:Depends}, ${python3:Depends}, libarrow2300 (= ${binary:Version}) Description: Python interface to the Arrow C++ libraries Apache Arrow defines a language-independent columnar memory format for flat and nested data, organized for efficient analytic operations on modern hardware like CPUs and GPUs. The Arrow memory format also supports zero-copy reads for lightning-fast data access without serialization overhead. . This library provides a Python API for functionality provided by the Arrow C++ libraries, along with tools for Arrow integration and interoperability with pandas, NumPy, and other software in the Python ecosystem. Package: pyarrow-dev Section: python Architecture: any Multi-Arch: no Breaks: python3-pyarrow (<< 23.0.1-2) Replaces: python3-pyarrow (<< 23.0.1-2) Depends: ${misc:Depends}, ${shlibs:Depends}, python3, libarrow-dev (= ${binary:Version}), python3-pyarrow (= ${binary:Version}) Description: Python interface to the Apache Arrow C++ libraries Apache Arrow defines a language-independent columnar memory format for flat and nested data, organized for efficient analytic operations on modern hardware like CPUs and GPUs. The Arrow memory format also supports zero-copy reads for lightning-fast data access without serialization overhead. . Arrow's libraries implement the format and provide building blocks for a range of use cases, including high performance analytics. Many popular projects use Arrow to ship columnar data efficiently or as the basis for analytic engines. . This provides C headers for interacting with the pyarrow interface to the Arrow C++ libraries. Package: r-cran-arrow Section: gnu-r Architecture: any Multi-Arch: no Depends: ${misc:Depends}, ${shlibs:Depends}, ${R:Depends}, r-api-4.0, r-cran-assertthat, r-cran-bit64, r-cran-cpp11, r-cran-glue, r-cran-purrr, r-cran-r6, r-cran-rlang, r-cran-tidyselect, r-cran-vctrs Description: R package for Apache Arrow data storage management Apache Arrow defines a language-independent columnar memory format for flat and nested data, organized for efficient analytic operations on modern hardware like CPUs and GPUs. The Arrow memory format also supports zero-copy reads for lightning-fast data access without serialization overhead. . Arrow's libraries implement the format and provide building blocks for a range of use cases, including high performance analytics. Many popular projects use Arrow to ship columnar data efficiently or as the basis for analytic engines. . This package provides the R interface to arrow. It is the same as published independently on CRAN. Package: arrow-tools Section: utils Architecture: any Multi-Arch: foreign Depends: ${misc:Depends}, ${shlibs:Depends}, libarrow2300 (= ${binary:Version}) Description: Apache Arrow IPC format conversion utilities Apache Arrow defines a language-independent columnar memory format for flat and nested data, organized for efficient analytic operations on modern hardware like CPUs and GPUs. The Arrow memory format also supports zero-copy reads for lightning-fast data access without serialization overhead. . This package provides command-line utilities for converting between Arrow IPC file and stream formats: arrow-file-to-stream and arrow-stream-to-file. Package: parquet-tools Section: utils Architecture: any Multi-Arch: foreign Depends: ${misc:Depends}, ${shlibs:Depends}, libparquet2300 (= ${binary:Version}) Description: Apache Parquet file inspection utilities Apache Arrow defines a language-independent columnar memory format for flat and nested data, organized for efficient analytic operations on modern hardware like CPUs and GPUs. The Arrow memory format also supports zero-copy reads for lightning-fast data access without serialization overhead. . This package provides command-line utilities for inspecting Parquet files: parquet-reader, parquet-dump-schema, parquet-dump-footer, parquet-dump-arrow-statistics, parquet-scan.