Part 02 AcessingHadoopAtTACC
Part 02 AcessingHadoopAtTACC
5
However, for this course,
6
Multi-Factor Authentication
• Multi-Factor Authentication with Duo
https://portal.xsede.org/mfa
Check Hadoop Reservation
• log on to Wrangler login node from your SSH client
>ssh [email protected]
8
Access Hadoop Reservation
Once the reservation status is “active”, a user can access
through slurm job:
9
Access Hadoop Cluster with VNC
Please visit: vis.tacc.utexas.edu
Enter creden;al
10
1. Choose Wrangler Tab
TRAINING-OPEN
11
An VNC Session Enable
Access to WebUI
• There several Web UIs run on different port
namenode
• Cluster information port 50070
• E.g. c252-101:50070
• Job information port 8088
• E.g c252-101:8088
12
13
14
15
16
Access Hadoop Reservation via idev
Session
User can submit idev session to hadoop cluster
reservation
Ø idev –r hadoop+TRAINING-OPEN+2375
Please limit your usage to Hadoop related tasks, you can also
submit idev without using reservation for non-hadoop tasks.
17
Slurm
Slurm is an open source, fault-tolerant, and highly scalable cluster
management and job scheduling system for large and small Linux
clusters.
• sbatch is used to submit a job script for later execution.
• sbatch myHadoopJob.slurm
• scancel is used to cancel a pending or running job or job step.
• scancel 1234
• scontrol is the administrative tool used to view and/or modify
Slurm state.
• scontrol show reservation
• sinfo reports the state of partitions and nodes managed by Slurm.
• squeue reports the state of jobs or job steps.
• squeue -u $USER
Batch Job Script
https://portal.tacc.utexas.edu/user-guides/
wrangler#hadoop-hdfs-jobs-on-wrangler
myHadoopJob.slurm
21
On project page choose: Manage -> Create Hadoop Reserva;on
the number of
nodes (1 ~10) to be Schedule
used for the Start ;me
Hadoop cluster.
Dura;on
(1-30 Days)
22