0% found this document useful (0 votes)

4 views17 pages

User Manual

HPC manual

Uploaded by

EZZALDEN AYMAN

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views17 pages

User Manual

HPC manual

Uploaded by

EZZALDEN AYMAN

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

High Performance Computing Facility

Indian Institute of Technology Goa

HPC user manual

Last updated: July 2024

IIT Goa- HPC user manual Page 1 of 17

Please read this manual carefully before using the HPC facility.

1. IIT Goa faculty/ researchers/ students are authorized to access this

facility.
2. These instructions are only for a user with some experience. You need
proficiency in Linux and parallel programming.
3. For new user registrations visit https://hpc.iitgoa.ac.in/ and fill the
“New user registration” form. You will receive an email upon
completion of the user creation process.
4. User usage policy:
a) Regular user (UG/ PG students) will get 100 GB storage limit.
b) Ph.D. / Research users will get 1 TB storage limit.
c) Faculty will get 20 TB storage limit.

IIT Goa- HPC user manual Page 2 of 17

1. About the cluster IIT Goa HPC

This cluster having 16 CPU nodes & 1 GPU node and 1 DGX
GPU node.
Configuration:
CPU Nodes:
 2x Intel Xeon-Gold 6248 Processor, 20 Core, total 40 cores,
 192 GB Memory.
 Mellanox 100Gbps Interconnect
 Total 16 nodes and 640 Cores for running MPI, OpenMP and
 Hybrid jobs.
 The Node names are node 1 to node 16.

GPU Node:
 2x Intel Xeon-Gold 6248 Processor, 20 Core, total 40 cores,
 192 GB Memory
 Mellanox 100Gbps Interconnect
 GPU Node having Nvidia Tesla V100 GPU Card and installed
 with necessary drivers and configured to work with slurm job
 scheduler.
 Total 40 Cores for running GPU jobs
 The node are named as gpu1.

Storage:
 This cluster also has 200 TB Lustre Storage in this storage is
allotted to "/home".

IIT Goa- HPC user manual Page 3 of 17

1 a. How to Access HPC

To Access or login to the HPC can be done using SSH (Secure Shell). You
can use software like Putty or directly access SSH from Terminal /
command prompt.

HPC Host name: hpc.iitgoa.ac.in

1. Connecting from Terminal/ Command line.

Example:

ssh [email protected]

2. Connect using Putty.

1. Connect to the hpc.iitgoa.ac.in 2. Enter your username and then password

IIT Goa- HPC user manual Page 4 of 17

2.Slurm Job Scheduler - CLI

a. Submitting Jobs
To submit job, user need to create a job script as follows
For GPU Nodes:

#!/bin/bash
#SBATCH --job-name=newjob
#SBATCH --partition=gpu
#SBATCH [email protected]
#SBATCH --nodes=1
#SBATCH --ntasks-per-node=2
#SBATCH --gres=gpu:1
#SBATCH --mail-type=ALL
#SBATCH --workdir=/home/iitgoa/
#SBATCH --output=newjob%j.out

cd $SLURM SUBMIT DIR

echo $SLURM JOB NODELIST > hostfile $SLURM
JOBID
For CPU Nodes:
To submit job user need to create a job script as follows
#!/bin/bash
#SBATCH -N 1
#SBATCH --ntasks-per-node=32
#SBATCH -J <testrun>
#SBATCH -p route
#SBATCH --time=24:00:00
#SBATCH -o slurmANAj.out
#STDOUT
#SBATCH -e slurmANAj.err
#STDERR
#SBATCH --export=all

IIT Goa- HPC user manual Page 5 of 17

#[email protected]
#SBATCH --mail-type=ALL
cd $SLURM SUBMIT DIR
echo $SLURM JOB NODELIST > hostfile $SLURM
JOBID

## OpenMP Case
## Make sure the ppn value and OMP NUM THREADS
value are same
or
##leave with SLURM NTASKS env variable
#export OMP NUM THREADS=$SLURM NTASKS
#<your executable> >& out $SLURM JOBID

#MPI Case
### Only one executable allowed
mpirun -np <npvalue>
<your/executable/with/path> >& out $SLURM JOBID

In this above file change what are in <...> to its

appropriate value and save as for example myscript.sh

Then submit the myscript.sh as follows :

#sbatch myscript.sh

This command submits jobs to scheduler and returns a job id. This
jobid
will be used later for monitoring and managing jobs.

IIT Goa- HPC user manual Page 6 of 17

b. Monitor the schedule queue

#squeue

Above command will out put the information about the

job is scheduler queue with its states and other details.
sq ueue

10131D PARTITION NAME USER ST TIME NODES NODE LIST

(REASON)

In the above output ST means State of the Job in

queue R means Running, P means Pending, C means
Completing and etc.

c. Cancel a job

scancel <jobid>

This command will cancel or delete the job

scancel -u <username>

This command will cancel all the job of user <username>

scancel -t PENDING -u <username>
This command will cancel all Pending jobs of user <username>

d. Other scheduler commands

scontrol show jobid <jobid>

This command will show the full details of job which is in

queue.
scontrol show jobid -dd <jobid>

This command will show the full details of job including its
job script file which is in queue.

IIT Goa- HPC user manual Page 7 of 17

sacct --
format=JoblD,JobName,MaxRSS,UserCPU,SystemCPU,CpuTime -j
<jobid>

This command will show the details of the job which

is completed and a day old.

For example:
sacct --
format=JoblD,JobName,MaxRSS,UserCPU,SystemCPU,CpuTime -j
2588
JobID JobName MaxRSS UserCPU SystemCPU
CPUTime

2588 MA00_5LSt+ 09:30:13 02:15.525

09:40:48
2588. batch batch 4588K 00:00.156 00:00.093
09:40:48
2588.0 pmi_proxy 2020K 09:30:13 02:15.431
00:18:09

scontrol hold <jobid>

This command hold the job which is queue but not running.
scontrol resume <jobid>

This command will release the hold job.

sstat --format=AvePages,AveRSS,AveVMSize,Jobl D
-j <jobid>
This command will show the statistics of a running job.

e. Slurm Partition details

This cluster is partitioned with respect to scheduler to use the

resource fairly. The following is the details of the slurm partition
information.

IIT Goa- HPC user manual Page 8 of 17

S.No Partition No of Nodes No of Cores Wall
1 cpu 16 640 INFINITE
2 gpu 1 40 INFINITE
3 All 17 680 INFINITE

3. Samooh CMS -Job submission portal

Samooh CMS version 1.6 is web based cluster

management suite which provides the job submission
and managing portal.

IIT Goa- HPC user manual Page 9 of 17

a. Dashbord

Users Dashboard will have Node Usage, Scheduler Partition usage

details. Node Usage details include Node name, Total CPU,
Allocated CPU, Free CPU, Free MEM and Current Load of each
node. Scheduler Partition usage details includes Job ID, Partition
Name, Username, Job Name, Job State, Time, Time Limit, CPUs,
Nodes, Node list, Actions. Actions include Information of job,
delete job, hold job and release holed job.

b. Job scheduler

Using this menu user can monitor the all their scheduled jobs
details like Job ID, Partition Name, User name, Job Name, Job
State, Time, Time Limit, CPUs, Nodes, Node list. Users also can do
the actions include ® get information of job, "X" delete job, hold
job and release old job.

Using this menu user can submit new job to users. This
will give window to provide necessary details like job
name, communication email, stdout file, working

IIT Goa- HPC user manual Page 10 of 17

directory, mail type, number of nodes, and number of
cores, wall time and very importantly job execution
commands.

This add new job window has info button at each input,
which will give required information about the input
field.

c. Scheduler -> Job Scheduler -> Add Job

IIT Goa- HPC user manual Page 11 of 17

d. Job Template

Job Template is used for saving frequently submitted jobs

details to the template show that it can be later used. Due to
that its saving lot of re typing of job commands using this
menu user can job template for various type of jobs. In this
job template user can set important settings of job like job
name, communication email, stdout file, working directory,
mail type, number of nodes, and number of cores, wall time
and very importantly job execution commands.

IIT Goa- HPC user manual Page 12 of 17

e. Job Statistics
Using this menu user can see their cluster usage statistics as
graph from given date interval. This include Date wise number
of core usage, Raw and Actual Usage, Overall Usage, and
Queue/Partition wise usage.

4.Installing Packages
The master node is configured in such a way that all software
installed in /home will be available to all compute nodes. So
we recommend to choose the installation path as follows.

/home/<usename>/softwares/<softwarename>/<version>

For Example: FFTW 3.3.7 installed in

/home/hpcuser/softwares/fftw/3.3.7

IIT Goa- HPC user manual Page 13 of 17

With this method we can have any number of version of
same package.
So if the package need ./configure command to install then
add

--prefix=
/home/<username>/softwares/<softwarename>/<version>

If the packed need cmake to install add -DINSTALL_PREFIX as

follows

-DINSTA LL PREFIX
homekusemainc>/softwarcsi<softwarenamc>/<version>

Then do make and make install.

5.Module

Once the package is installed it is recommended to create

module
for the same that package necessary environment variable
loaded and
remove using module command
To load module
module load <modulename>/<version>
For ex: module load fftw/3.3.7
To unload module
module rm <mdoule>/<version>
For Ex: module rm fftw/3.3.7

Suppose if module version is not specified then the

higher version number is chosen automatically.

a. How to create module file

IIT Goa- HPC user manual Page 14 of 17

 Create folder for module
mkdir /home/username/modulefiles
 Create folder under
/home/<username>/modulefiles/<packagename> with
package name
mkdir /home/<username>/modulefileg<packagename>

 then inside the

/home/<username>/modulefiles/<packagename>
create a file with version string with following main contents
cd /home/<username>/modulefiles<packagename>

vim 3.3.7 (version string)

1. First line should be #%Module1.0
2. Section module-what is, prepend-path as following
example

IIT Goa- HPC user manual Page 15 of 17

b. Available Modules

IIT Goa- HPC user manual Page 16 of 17

c. List loaded modules
module list

d. Load a module

module load <modulename>

e. Remove a loaded module

module remove <modulename>

IIT Goa- HPC user manual Page 17 of 17

Powershell High Performance Computing Hpc19 Ps
No ratings yet
Powershell High Performance Computing Hpc19 Ps
1,295 pages
IBM SAN Vol Ctrler, Flashsystem and Storewize Family Command-Line Interface Guide
No ratings yet
IBM SAN Vol Ctrler, Flashsystem and Storewize Family Command-Line Interface Guide
2,378 pages
Exam Questions 2V0-21.23: Vmware Vsphere 8.X Professional
No ratings yet
Exam Questions 2V0-21.23: Vmware Vsphere 8.X Professional
12 pages
Windows Server Storage
100% (1)
Windows Server Storage
1,131 pages
PARAM Ganga User Manual 24-01-2023
No ratings yet
PARAM Ganga User Manual 24-01-2023
106 pages
Dell InsightIQ 4.4.0.0 Administration Guide
No ratings yet
Dell InsightIQ 4.4.0.0 Administration Guide
44 pages
HPC User Manual-Updated
No ratings yet
HPC User Manual-Updated
4 pages
Building Real Time Analytics Applications
No ratings yet
Building Real Time Analytics Applications
36 pages
Cs8711 Cloud Computing Lab Cse
No ratings yet
Cs8711 Cloud Computing Lab Cse
96 pages
Intro To Slurm
No ratings yet
Intro To Slurm
27 pages
FAS Administration Guide
No ratings yet
FAS Administration Guide
199 pages
Commvault HyperSclae X Technology
No ratings yet
Commvault HyperSclae X Technology
20 pages
HPC School - Beginner S2
No ratings yet
HPC School - Beginner S2
54 pages
HPC - Model - Paper
No ratings yet
HPC - Model - Paper
3 pages
Slides
No ratings yet
Slides
33 pages
2022 - HPC Training 04 - HPC Basic Usage
No ratings yet
2022 - HPC Training 04 - HPC Basic Usage
77 pages
Installation and Deployment Guide
No ratings yet
Installation and Deployment Guide
19 pages
Full System Flash
100% (1)
Full System Flash
81 pages
Pavan Sai Kagitha
No ratings yet
Pavan Sai Kagitha
3 pages
HPC Introduction Lecture 2
No ratings yet
HPC Introduction Lecture 2
55 pages
A Comprehensive Survey On 5G-And-Beyond Networks With UAVs Applications Emerging Technologies Regulatory Aspects Research Trends and Challenges
No ratings yet
A Comprehensive Survey On 5G-And-Beyond Networks With UAVs Applications Emerging Technologies Regulatory Aspects Research Trends and Challenges
41 pages
AAC - Overview
No ratings yet
AAC - Overview
39 pages
UNM2000 - Network Convergence Management System (Based On Windows) - Active-Standby System Installation Guide - A
No ratings yet
UNM2000 - Network Convergence Management System (Based On Windows) - Active-Standby System Installation Guide - A
318 pages
Imp Datastage New
100% (1)
Imp Datastage New
158 pages
Introductory Supercomputing PDF
No ratings yet
Introductory Supercomputing PDF
94 pages
04 Qos
No ratings yet
04 Qos
25 pages
Zaratan HPC User Guide
No ratings yet
Zaratan HPC User Guide
4 pages
TorqueAdminGuide 2.5.12
No ratings yet
TorqueAdminGuide 2.5.12
282 pages
19 JobSchedulers
No ratings yet
19 JobSchedulers
37 pages
Slurm Usage Guide
No ratings yet
Slurm Usage Guide
6 pages
Introduction To Einstein HPC Portal-V3
No ratings yet
Introduction To Einstein HPC Portal-V3
2 pages
HPC Cheat Sheet
No ratings yet
HPC Cheat Sheet
1 page
Ansys 2023 R1 - Job Schedulers and Queuing Systems Support
No ratings yet
Ansys 2023 R1 - Job Schedulers and Queuing Systems Support
1 page
Manual For Using Super Computing Resources
No ratings yet
Manual For Using Super Computing Resources
22 pages
Dell Emc Vxrail 47 Os10 Switch CG
No ratings yet
Dell Emc Vxrail 47 Os10 Switch CG
42 pages
Serverservices - Gpu-Cluster (LME - WIKI)
No ratings yet
Serverservices - Gpu-Cluster (LME - WIKI)
4 pages
Understanding Software Engineering Vol 3: Programming Basic Software Functionalities.
From Everand
Understanding Software Engineering Vol 3: Programming Basic Software Functionalities.
Gabriel Clemente
No ratings yet
Pages From Introduction To Einstein HPC Portal-V3-2
No ratings yet
Pages From Introduction To Einstein HPC Portal-V3-2
3 pages
Fortinet FortiGate HA (High Availability)
No ratings yet
Fortinet FortiGate HA (High Availability)
5 pages
Linux Clusters Institute: Scheduling
No ratings yet
Linux Clusters Institute: Scheduling
93 pages
5529 APC Release 9.3.10 Installation and Customization Guide PDF
No ratings yet
5529 APC Release 9.3.10 Installation and Customization Guide PDF
92 pages
Bastion Architecture
No ratings yet
Bastion Architecture
10 pages
Summary
No ratings yet
Summary
2 pages
PARAM - Sanganak User Manual
No ratings yet
PARAM - Sanganak User Manual
81 pages
All My IT Tech Posts
From Everand
All My IT Tech Posts
Stephen Edwards
No ratings yet
PBS-Documentation May17
No ratings yet
PBS-Documentation May17
9 pages
Tutorial HDP Supplement
No ratings yet
Tutorial HDP Supplement
33 pages
Intro HPC Linux Gent
No ratings yet
Intro HPC Linux Gent
124 pages
CCF Usage Manual
No ratings yet
CCF Usage Manual
8 pages
Bunya User Guide 2022 12 06
No ratings yet
Bunya User Guide 2022 12 06
10 pages
67 HMC870 EnhancedPlus GUI
No ratings yet
67 HMC870 EnhancedPlus GUI
74 pages
Calcul Quebec Presentation
No ratings yet
Calcul Quebec Presentation
59 pages
Installation and Upgrade Guide: Access Manager Appliance 4.4
No ratings yet
Installation and Upgrade Guide: Access Manager Appliance 4.4
84 pages
Scheduler Commands Cheatsheet-2020-Ally
No ratings yet
Scheduler Commands Cheatsheet-2020-Ally
1 page
Devops Introd Ction
No ratings yet
Devops Introd Ction
21 pages
Oracle 11g RAC Implementation Guide
100% (1)
Oracle 11g RAC Implementation Guide
38 pages
Vcs Notes 51sp1 Aix
No ratings yet
Vcs Notes 51sp1 Aix
66 pages
Cluster and SVM Peering Express Guide: Ontap 9
No ratings yet
Cluster and SVM Peering Express Guide: Ontap 9
15 pages
3D Cisco Icon Library v2 3 1
No ratings yet
3D Cisco Icon Library v2 3 1
23 pages
Nvidia DGX Pod Data Center Reference Design
No ratings yet
Nvidia DGX Pod Data Center Reference Design
19 pages
Server Cluster Guide For Windows 2003 Server
No ratings yet
Server Cluster Guide For Windows 2003 Server
50 pages
CT 02 Basics Batchsystem
No ratings yet
CT 02 Basics Batchsystem
23 pages
Slurm Talk
No ratings yet
Slurm Talk
40 pages
SciNet Tutorial
No ratings yet
SciNet Tutorial
22 pages
HP StoreVirtual VSA Design and Configuration Guide
No ratings yet
HP StoreVirtual VSA Design and Configuration Guide
46 pages
Kubernetes Made Easy
From Everand
Kubernetes Made Easy
Pankaj Joshi
No ratings yet
How To Make Computers Work For You When You Are Enjoying Life
No ratings yet
How To Make Computers Work For You When You Are Enjoying Life
29 pages
Parallel Programming Using MPI
No ratings yet
Parallel Programming Using MPI
69 pages
HPC Job
No ratings yet
HPC Job
8 pages
Some Very Under Done Instructions For HPC 2013: Hpc@lists - Iitk.ac - in
No ratings yet
Some Very Under Done Instructions For HPC 2013: Hpc@lists - Iitk.ac - in
4 pages
User Guide of High Performance Computing Cluster in School of Physics
No ratings yet
User Guide of High Performance Computing Cluster in School of Physics
8 pages
Foundation Course for Advanced Computer Studies
From Everand
Foundation Course for Advanced Computer Studies
Franck Ismael Djédjé
No ratings yet
Building A Hyper-V Cluster Using The Microsoft iSCSI Software Target
No ratings yet
Building A Hyper-V Cluster Using The Microsoft iSCSI Software Target
48 pages
Interview Questions for IBM Mainframe Developers
From Everand
Interview Questions for IBM Mainframe Developers
Robert Wingate
1/5 (1)
WebLogic 12c Dynamic Clusters
No ratings yet
WebLogic 12c Dynamic Clusters
8 pages
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)
HPC Login Form Iitk Users
No ratings yet
HPC Login Form Iitk Users
2 pages
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
From Everand
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
MARIO FRANCO
No ratings yet
Mastering Mesos - Sample Chapter
No ratings yet
Mastering Mesos - Sample Chapter
36 pages
Legion Quick Reference Sheet: Access. Job Script Options. Resource Limits
No ratings yet
Legion Quick Reference Sheet: Access. Job Script Options. Resource Limits
1 page
Network with Practical Labs Configuration: Step by Step configuration of Router and Switch configuration
From Everand
Network with Practical Labs Configuration: Step by Step configuration of Router and Switch configuration
Mulayam Singh
No ratings yet
Network with Practical: ALL PACKET TRACER LABS
From Everand
Network with Practical: ALL PACKET TRACER LABS
MULAYAM SINGH
No ratings yet
CISCO PACKET TRACER LABS: Best practice of configuring or troubleshooting Network
From Everand
CISCO PACKET TRACER LABS: Best practice of configuring or troubleshooting Network
Mulayam Singh
No ratings yet
Sega Saturn Architecture: Architecture of Consoles: A Practical Analysis, #5
From Everand
Sega Saturn Architecture: Architecture of Consoles: A Practical Analysis, #5
Rodrigo Copetti
No ratings yet
Node.js 63 Interview Questions and Answers
From Everand
Node.js 63 Interview Questions and Answers
John Edward Cooper Berg
No ratings yet
First Hop Redundancy Protocol: Network Redundancy Protocol
From Everand
First Hop Redundancy Protocol: Network Redundancy Protocol
Mulayam Singh
No ratings yet
PC Engine / TurboGrafx-16 Architecture: Architecture of Consoles: A Practical Analysis, #16
From Everand
PC Engine / TurboGrafx-16 Architecture: Architecture of Consoles: A Practical Analysis, #16
Rodrigo Copetti
No ratings yet
Cisco CCNA Command Guide: An Introductory Guide for CCNA & Computer Networking Beginners: Computer Networking, #3
From Everand
Cisco CCNA Command Guide: An Introductory Guide for CCNA & Computer Networking Beginners: Computer Networking, #3
Ramon Nastase
4.5/5 (2)
Dreamcast Architecture: Architecture of Consoles: A Practical Analysis, #9
From Everand
Dreamcast Architecture: Architecture of Consoles: A Practical Analysis, #9
Rodrigo Copetti
No ratings yet
Configuration of a Simple Samba File Server, Quota and Schedule Backup
From Everand
Configuration of a Simple Samba File Server, Quota and Schedule Backup
Dr. Hidaia Mahmood Alassouli
No ratings yet

User Manual

Uploaded by

User Manual

Uploaded by

High Performance Computing Facility

Indian Institute of Technology Goa

HPC user manual

Last updated: July 2024

IIT Goa- HPC user manual Page 1 of 17

1. IIT Goa faculty/ researchers/ students are authorized to access this

IIT Goa- HPC user manual Page 2 of 17

IIT Goa- HPC user manual Page 3 of 17

HPC Host name: hpc.iitgoa.ac.in

1. Connecting from Terminal/ Command line.

2. Connect using Putty.

1. Connect to the hpc.iitgoa.ac.in 2. Enter your username and then password

IIT Goa- HPC user manual Page 4 of 17

cd $SLURM SUBMIT DIR

IIT Goa- HPC user manual Page 5 of 17

In this above file change what are in <...> to its

Then submit the myscript.sh as follows :

IIT Goa- HPC user manual Page 6 of 17

Above command will out put the information about the

10131D PARTITION NAME USER ST TIME NODES NODE LIST

In the above output ST means State of the Job in

This command will cancel or delete the job

This command will cancel all the job of user <username>

d. Other scheduler commands

scontrol show jobid <jobid>

This command will show the full details of job which is in

IIT Goa- HPC user manual Page 7 of 17

This command will show the details of the job which

2588 MA00_5LSt+ 09:30:13 02:15.525

scontrol hold <jobid>

This command will release the hold job.

e. Slurm Partition details

This cluster is partitioned with respect to scheduler to use the

IIT Goa- HPC user manual Page 8 of 17

3. Samooh CMS -Job submission portal

Samooh CMS version 1.6 is web based cluster

IIT Goa- HPC user manual Page 9 of 17

Users Dashboard will have Node Usage, Scheduler Partition usage

IIT Goa- HPC user manual Page 10 of 17

c. Scheduler -> Job Scheduler -> Add Job

IIT Goa- HPC user manual Page 11 of 17

Job Template is used for saving frequently submitted jobs

IIT Goa- HPC user manual Page 12 of 17

For Example: FFTW 3.3.7 installed in

IIT Goa- HPC user manual Page 13 of 17

If the packed need cmake to install add -DINSTALL_PREFIX as

Then do make and make install.

Once the package is installed it is recommended to create

Suppose if module version is not specified then the

a. How to create module file

IIT Goa- HPC user manual Page 14 of 17

 then inside the

vim 3.3.7 (version string)

IIT Goa- HPC user manual Page 15 of 17

IIT Goa- HPC user manual Page 16 of 17

module load <modulename>

e. Remove a loaded module

module remove <modulename>

IIT Goa- HPC user manual Page 17 of 17

You might also like