Learn Data Engineering
Only $19.97/month
Search

How to develop Spark applications with Zeppelin notebooks

Updated: May 25

I love working with Zeppelin notebooks. Its so simple and you can just try something out.

Especially working with dataframes and SparkSQL is a blast.

What is a Zeppelin?


A Zeppelin is a tool, a notebook tool, just like Jupiter. You can run it on a server and you can run it on your Hadoop cluster or whatever. And it can run Spark jobs in the background.

Advantages of Zeppelin


The nice thing about it is that you have your notebook. You can create in the notebook, then run it more or less directly here and you can have a visualization that you cannot have in a normal spark environment.

So you have your notebook, you write your code, then you can make sequel queries and visualize the stuff directly - as tables, bar charts, line graphs and so on. The possibilities are really amazing. You can also easily download the plots as CSV files.

Need for Zeppelin


As you can see, the whole thing really has enormous advantages. I'm definitely convinced that you need this Zeppelin stuff. You have to know what's happening, what's going on. You have to learn this and you need to know this. That's absolutely important.

But in my opinion it's fun - tremendous fun!

What do you think about Zeppelin notebooks? Do you think it is as important as I do? Let me know in the comments!




>> created by Mira Roth








Check out my full video on YouTube!

166 views

© 2020 Team Data Sicence - Andreas Kretz

  • Black LinkedIn Icon
  • Black Twitter Icon
  • Black Facebook Icon
  • Black Instagram Icon