I love working with Zeppelin notebooks. Its so simple and you can just try something out.
Especially working with dataframes and SparkSQL is a blast.
What is a Zeppelin?
A Zeppelin is a tool, a notebook tool, just like Jupiter. You can run it on a server and you can run it on your Hadoop cluster or whatever. And it can run Spark jobs in the background.
Advantages of Zeppelin
The nice thing about it is that you have your notebook. You can create in the notebook, then run it more or less directly here and you can have a visualization that you cannot have in a normal spark environment.
So you have your notebook, you write your code, then you can make sequel queries and visualize the stuff directly - as tables, bar charts, line graphs and so on. The possibilities are really amazing. You can also easily download the plots as CSV files.
Need for Zeppelin
As you can see, the whole thing really has enormous advantages. I'm definitely convinced that you need this Zeppelin stuff. You have to know what's happening, what's going on. You have to learn this and you need to know this. That's absolutely important.
But in my opinion it's fun - tremendous fun!
What do you think about Zeppelin notebooks? Do you think it is as important as I do? Let me know in the comments!
>> created by Mira Roth
My free 100+ pages Data Engineering Cookbook
Follow us on LinkedIn
Check out my YouTube
Check out my full video on YouTube!
Comments