Alert: Sparklet Installer from Mund Consulting is not available anymore on their official site. So we recommend this installation process instead of the single installer for Spark and Zeppelin.
Are you into the Big Data syndrome yet? Big Data analytics and reporting is the next big thing in tech world these days. Apache spark is fast becoming the most favored platform of crunching large scale data. As the makers of Apache Spark put it :
Run programs up to 100x faster than Hadoop MapReduce in memory, or 10x faster on disk.
Write applications quickly in Java, Scala, Python, R.
Combine SQL, streaming, and complex analytics.
Yeah I know about Spark cluster computing – whats the big deal?
Along with the Spark revolution the industry is witnessing a strong push towards notebooks (interactive data analysis and interpretation tools). You might have heard names like iPython Notebook in this segment. But the new entrant to this league – Apache Zeppelin is taking the industry by storm.
With support for multiple programming languages, SQL query and graphic tools like GraphX Zeppelin is a must have if you are getting into Big Data and Spark at this moment.
Now, as a data scientist, you may not want to get into the intricacies of installing two different software and worry about version compatibility, technical aspects of installation how tos and more. Interestingly there exists a easy to use 2 click installer that sets up Spark and Zeppelin together on your windows machine. We recommend this to candidates attending our Big Data and Hadoop Courses for saving time and avoiding confusions.
If you are a Ubuntu, Linux, Mac fan well you might have to wait for the mac equivalent versions to hit the stands.