neo4j-spark-connector

Neo4j Connector for Apache Spark, which provides bi-directional read/write access to Neo4j from Spark, using the Spark DataSource APIs

APACHE-2.0 License

Stars
312

Bot releases are visible (Hide)

neo4j-spark-connector - Neo4j Connector for Apache Spark

Published by conker84 almost 4 years ago

  • First major release of the Neo4j Connector for Apache Spark
  • Based around the DataSource API, with polyglot support
  • Supports (2.4.5 <= Spark < 3)
  • Supports Neo4j 3.5, 4.0, and 4.1
neo4j-spark-connector - Second pre release of the Neo4j Connector for Apache Spark

Published by conker84 about 4 years ago

Neo4j Streams Kafka Connect (8)

Hello community!
We're happy to share the second pre-release of the Neo4j Connector for Apache Spark!

The connector supports Apache Spark starting from version 2.4.1.

You can find the documentation at the following link.

If you want to provide us feedback, please raise an issue.

In attachment you will find two files:

  • neo4j-spark-connector-2.11-4.0.0-pre2.jar which is for Scala 2.11
  • neo4j-spark-connector-2.12-4.0.0-pre2.jar which is for Scala 2.12
neo4j-spark-connector - First pre release of the Neo4j Connector for Apache Spark

Published by conker84 about 4 years ago

Hello community!
We're happy to share the first pre-release of the Neo4j Connector for Apache Spark.

The connector supports Apache Spark starting from verson 2.4.1.

You can find the documentation at the following link.

If you want to provide us feedback, please raise an issue.

In attachment you will find two files:

  • neo4j-spark-connector-2.11-4.0.0-pre1.jar which is for Scala 2.11
  • neo4j-spark-connector-2.12-4.0.0-pre1.jar which is for Scala 2.12
neo4j-spark-connector - Release 2.4.5-M2 for Scala 12

Published by jexp about 4 years ago

Thanks a lot to @conker84 @nielsdejong @utnaf for your contributions.

Contains the following fixes and improvements.

  • Updated saveGraph.updateNodes to use numPartitions instead of batch size.
  • Added correct numPartitions calculation for edges
  • fixes #133: Neo4jDataFrame#mergeEdgeList should skip node creation (#137)
  • fixes #138: Support for Scala 2.12.x (#146)
  • fixes #132: Supporting temporal data types using Neo4jDataFrame.mergeEdgeList (#136)
  • Updated saveGraph.updateNodes to use numPartitions instead of batch size.
neo4j-spark-connector - Release 2.4.5-M1 for Neo4j 4.0, Spark 2.4.5

Published by jexp over 4 years ago

This release makes the spark connector ready for Neo4j 4.0, we also upgraded the spark depedency.

Thanks to @conker84 @moxious and @JakobWyatt-VGW for your work and contributions.

  • spark 2.4.5
  • neo4j 4.0 and multi-db support
  • retry
  • separate node from relationship creation
  • safe closing of driver and session
  • new, shorter config parameters
  • improved datetime handling

The release is also available on https://spark-packages.org/package/neo4j-contrib/neo4j-spark-connector

neo4j-spark-connector - Release 2.4.1-M1

Published by neo4j-oss-build over 4 years ago

Release with some bugfixes and additions.

Bigger release will be coming in a few weeks.

  • Spark 2.4.0

  • Neo4j 3.5.x

  • Driver 1.7.5

  • encryption flag config

  • handling of datetime values

  • better recursive collection handling

  • handling of sequences

  • set complex properties on vertices

  • documentation for mergeEdgeList with relationship properties

  • Neo4jDataFrame#createNodes method and renameMap to mergeEdgeList

thanks to @JakobWyatt-VGW @echohlne @rbramley @tomerlevi1983 @conker84 for your contributions!

neo4j-spark-connector - Release 2.2.1-M5

Published by jexp over 6 years ago

This release updates the Neo4j Bolt driver to 1.6.1 and allows for separate read and write transactions in the implementation so that bolt+routing URIs and reading from Read-Replicas should work.

Also updates the spark-dependency to 2.2.1

Package Rankings
Top 23.02% on Repo1.maven.org