Skip to content

Pyarrow apache-arrow-20.0.0 #7750

@Jenkins-J

Description

@Jenkins-J

Package Name

Pyarrow

Package Version

apache-arrow-20.0.0

GitHub Repo url

https://github.com/apache/arrow

Desired packaging

build script only

When utilizing this package, we receive errors when importing pyarrow:

root@pod libabsl$ pip3 install -q --index-url https://${artifactory_user}:${artifactory_token}@na.artifactory.swg-devops.com/artifactory/api/pypi/hyc-d-skywalker-team-lakehouse-on-prem-pypi-local/simple pyarrow==20.0.0
WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager. It is recommended  use a virtual environment instead: https://pip.pypa.io/warnings/venv
root@pod libabsl$ python3 -c "import pyarrow.lib as _lib"
Traceback (most recent call last):
  File "<string>", line 1, in <module>
  File "/usr/local/lib64/python3.9/site-packages/pyarrow/__init__.py", line 61, in <module>
    import pyarrow.lib as _lib
ImportError: libabsl_leak_check.so.2401.0.0: cannot open shared object file: No such file or directory
root@pod libabsl$

A number of other shared object files have also been listed as missing:

=======================================
ArrowEvalPythonExecSuite:
Traceback (most recent call last):
  File "/opt/shims/spark35/spark_home/python/pyspark/sql/pandas/utils.py", line 53, in require_minimum_pyarrow_version
    import pyarrow
  File "/usr/local/lib64/python3.9/site-packages/pyarrow/__init__.py", line 61, in <module>
    import pyarrow.lib as _lib
ImportError: libutf8proc.so.2: cannot open shared object file: No such file or directory

==========================================
ArrowEvalPythonExecSuite:
Traceback (most recent call last):
  File "/opt/shims/spark35/spark_home/python/pyspark/sql/pandas/utils.py", line 53, in require_minimum_pyarrow_version
    import pyarrow
  File "/usr/local/lib64/python3.9/site-packages/pyarrow/__init__.py", line 61, in <module>
    import pyarrow.lib as _lib
ImportError: liborc.so: cannot open shared object file: No such file or directory

==========================================
ArrowEvalPythonExecSuite:
Traceback (most recent call last):
  File "/opt/shims/spark35/spark_home/python/pyspark/sql/pandas/utils.py", line 53, in require_minimum_pyarrow_version
    import pyarrow
  File "/usr/local/lib64/python3.9/site-packages/pyarrow/__init__.py", line 61, in <module>
    import pyarrow.lib as _lib
ImportError: libprotobuf.so.25.3.0: cannot open shared object file: No such file or directory

Has this issue been encountered before?
Does using the package this way require the package to be built/rebuilt with a different configuration?

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions