Technology, Code&Create, Events...
Orbis Connect presents Code & Create, a series of webinar and learning sessions by experts in the industry.
Within the second installment of these special Code & Create workshops, we look at Apache Spark with guest speaker, Neeraj Bhadani - Data Scientist at Expedia Group.
Neeraj has more than a decade of experience building software, currently working in AI & Data Science team at Expedia Group. Prior to Expedia Group, he worked on various Big Data projects, dealt directly with clients as a Technical specialist, and migrated various ETL pipelines to Apache Spark. He also received a Gold Medal for securing first place in his batch during his undergraduate days.
During this workshop, we discuss the fundamentals of Apache Spark and how we can use spark DataFrames for Data Analysis.
Apache Spark is a General-purpose computing engine that has in-memory computing capabilities. It can be used for a variety of workloads like Batch processing, Iterative problems, stream processing, etc. It is designed to be highly scalable and provides various APIs like Scala, Python, R, Java, and SQL. It can be easily integrated with other BIG Data tools as well.
This is the second instalment of a two part series on Apache Spark. Missed the Foundation to Apache Spark session? No worries, check it out here!