Skip to content

bug: graphframes on Databricks Free Edition throws No handler found for extension type #836

@olbapjose

Description

@olbapjose

Describe the bug

I have installed graphframes-py in Databricks Free Edition with environment version 5 by !pip install graphframes-py. I can import it and create a graph, but algorithms like pageRank or connectedComponents throw No handler found for extension type: type.googleapis.com/org.graphframes.connect.proto.GraphFrames.

  • The workaround suggested in another thread of setting spark.databricks.service.server.enabled to false is not possible because Free Edition does not allow spark.conf.set.
  • Downloading the JAR file manually is possible but I cannot make Spark read it, as it is not possible to either write on current classpath folders (I have tried but they are all protected) nor modify the CLASSPATH variable - it won't take the new value.

To Reproduce

Steps to reproduce the behavior:

  1. Create a notebook in Databricks Free Edition
  2. !pip install graphframes-py
  3. Create a graph, and try g.pageRank(...) or g.connectedComponents.
  4. However, g.degrees.show() or g.inDegrees.show() or g.outDegrees.show() are working normally

Expected behavior

The algorithms should run normally.

System [please complete the following information]:

  • OS: e.g. [Ubuntu 18.04]
  • Python Version (if applied): Python 3.12.3 that comes with Databricks Free environment 5
  • Spark / PySpark version: 4.1.0
  • GraphFrames version: graphframes-py==0.9.2 and graphframes-py==0.11.0

Component

  • Scala Core Internal
  • Scala API
  • Spark Connect Plugin
  • PySpark Classic
  • PySpark Connect

Additional context

Are you planning on creating a PR?

  • I'm willing to make a pull-request

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions