Cloud Computing Assignment 2
Cloud Computing Assignment 2
Name VANISHREE B
2. Install Java:
- Hadoop requires Java to run. Install a compatible version of
Java on each machine in the cluster.
3. Set Up SSH:
- Configure SSH (Secure Shell) to enable passwordless
communication between nodes in the cluster. Generate SSH
keys and distribute them to all machines to allow seamless
communication.
5. Configure Hadoop:
- Modify the Hadoop configuration files to suit your cluster
setup.
- The key configuration files are core-site.xml, hdfs-site.xml,
and mapred-site.xml. These files specify parameters such as
cluster node addresses, replication factor, and other settings.
- Update these configuration files on all machines consistently.
6. Set Up Hadoop Environment Variables:
- Configure the Hadoop environment variables on each
machine to specify the Java installation path and Hadoop home
directory.
- Add these variables to the respective user profiles or set them
in the hadoop-env.sh file.
11. Access the Instance: After launching the instance, you can
connect to it using remote desktop protocol (RDP) for Windows
instances. AWS provides a public DNS name or IP address that
you can use to connect to your Windows instance.