Read more about the article Apache Spark RDD reduceByKey transformation
Apache Spark RDD reduceByKey transformation

Apache Spark RDD reduceByKey transformation

reduceByKey(func) converts a dataset of (K, V) pairs, into a dataset of (K, V) pairs where the values for each key are aggregated using the given reduce function.

Read more about the article Apache Spark RDD groupBy transformation
Apache Spark RDD groupBy transformation

Apache Spark RDD groupBy transformation

As per Apache Spark documentation, groupBy returns an RDD of grouped items where each group consists of a key and a sequence of elements.

Read more about the article Apache Spark RDD filter transformation
Apache Spark RDD’s filter transformation

Apache Spark RDD filter transformation

As per Apache Spark, filter(function) returns a new dataset formed by selecting those elements of the source on which function returns true.

Read more about the article Working with AVRO data in Apache Spark
Working with AVRO data

Working with AVRO data in Apache Spark

In this video we will understand how to work with AVRO data in Apache Spark.For the demo we are using Spark 2.4 version and scala language.

Read more about the article Installing Apache Spark on Windows
Installing Apache Spark on Windows

Installing Apache Spark on Windows

In this post I will tell you how to install Apache Spark on windows machine. By the end of this tutorial you’ll be able to use Spark with Scala on windows.