sparklyr

R interface for Apache Spark

APACHE-2.0 License

Stars
929
Committers
96

Bot releases are hidden (Show)

sparklyr - CRAN v0.6.2

Published by javierluraschi about 7 years ago

CRAN 0.6.2 - Patch to address the following issues:

sparklyr - CRAN v0.6.1

Published by javierluraschi about 7 years ago

CRAN 0.6.1 - Patch to address the following issues:

sparklyr - CRAN v0.6

Published by javierluraschi about 7 years ago

CRAN 0.6 - Major release with support to:

  • Distribute R computations using spark_apply() to execute arbitrary R code across your cluster over rows and groups.
  • Connect to External Data using spark_read_source(), spark_write_source, spark_read_jdbc() and spark_write_jdbc().
  • Use the Latest Frameworks including dplyr 0.7, DBI 0.7, RStudio 1.1 and Spark 2.2.
  • Spark Connections add a new Databricks connection that enables using sparklyr in Databricks through mode="databricks", add support for Yarn Cluster through master="yarn-cluster" and connection speed was also improved.
  • Dataframes add support for sdf_pivot(), sdf_broadcast(), cbind(), rbind(), sdf_separate_column(), sdf_bind_cols(), sdf_bind_rows(), sdf_repartition() and sdf_num_partitions().
  • Machine Learning adds support for multinomial regression in ml_logistic_regression(), weights.column for GLM, ml_model_data() and a new ft_count_vectorizer() function for ml_lda().
  • Many other features, initial support for broom over ml_linear_regression() and ml_generalized_linear_regression(), dplyr support for %like%, %rlike% and%regexp%, extensions now support download_scalac() to help you install the required Scala compilers while developing extensions, database management got simplified with tbl_change_db() and src_databases() to query and switch between HIVE databases and started a joint effort with Microsoft to improve Spark installations under github.com/rstudio/spark-install.

Additional changes and improvements can be found in the sparklyr NEWS file and the sparklyr 0.6 release blog post.

sparklyr - CRAN v0.5.6

Published by javierluraschi over 7 years ago

CRAN 0.5.6 - Patch to address the following issues:

sparklyr - CRAN v0.5.5

Published by javierluraschi over 7 years ago

CRAN 0.5.5 - Patch release over 0.5.4 with compatibility fixes for RStudio 1.1 and mode="databricks" connections.

sparklyr - CRAN v0.5.4

Published by javierluraschi over 7 years ago

CRAN 0.5.4 - Patch release over 0.5.3 with compatibility fixes for dplyr 0.6 and Spark 2.1.

sparklyr - CRAN v0.5.3

Published by javierluraschi over 7 years ago

CRAN 0.5.3 - Patch release over 0.5.2 with fix for new DBI:: sqlInterpolate #351 and mutate under dplyr 0.6 #523.

sparklyr - CRAN 0.5.2

Published by javierluraschi over 7 years ago

CRAN 0.5.2 - Patch release over 0.5.1 with fixes for connections under Windows that require path normalization https://github.com/rstudio/sparklyr/pull/492 and Livy connections to handle states properly while connecting https://github.com/rstudio/sparklyr/pull/421.

sparklyr - CRAN 0.5.1

Published by javierluraschi almost 8 years ago

Additional changes and improvements can be found in the sparklyr NEWS file and the sparklyr 0.5 release blog post.

sparklyr - CRAN 0.4

Published by javierluraschi about 8 years ago

Additional changes and improvements can be found in the sparklyr NEWS file and the sparklyr 0.4 release blog post.