This guide covers what you need to learn Big Data. Fort this guide, I use Windows 10 running with WSL 2 using Ubuntu 20.04. This article includes the following software and tools:
- Hadoop 2.10.1
- Apache Scoop
- Apache Flume
- Apache Kafka
- Apache Spark
- My SQL 8
- Apache Pig
- Apache Hive
- Apache HBase
- Apache NoSQL
- Apache MLLib
- Apache GraphX
- Scala REPL
Congratulations! You have successfully installed all you need to learn Big Data in your Ubuntu subsystem of Windows 10. It’s relatively easier as we don’t need to download or compile/build native Hadoop libraries.
BTW, WSL is not a virtual machine however it provides you almost the same experience as you would have in a native Linux system.
Happy coding!