0% found this document useful (0 votes)

3 views25 pages

H5py Python

The document discusses the application of the Python programming language and the HDF5 interface (H5py) in processing satellite remote sensing data. It highlights the advantages of using Python for both numerical and non-numerical data processing, emphasizing its flexibility, ease of development, and compatibility with HDF5. The document also provides examples of Python code for data aggregation and manipulation using H5py, illustrating its practical use in remote sensing applications.

Uploaded by

Expert_Modeller

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views25 pages

H5py Python

Uploaded by

Expert_Modeller

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

The Python Programming Language and HDF5: H5py.

Application to Satellite Remote Sensing Data Processing.

Needs of satellite remote sensing data processing and

programming languages.
Introduction to some Python elements.
Introduction to Numerical Python with Numpy.
Tying them together with H5py.
Examples.
Daniel Kahn, Science Systems and Applications Inc.
HDF/HDF-EOS Workshop XIII, Riverdale, MD
3 Nov 2009
Where credit is due:

Andrew Collette, UCLA, author of h5py

Francesc Alted, author of PyTables and the

HDF5 1.6 Pyrex definitions (i.e Python-C defs
used in H5py)

Andrew also requested acknowledgment for Darren

Dale (Cornell) and Armando Sole (ESRF) for their
suggestions, patches and encouragement.
Modern remote sensing satellite data and data processing
needs have several characteristics.
1) The data need to be subjected to numerical algorithms. The
algorithms vary in their sophistication but few, if any, require
supercomputer architectures. E.g. Computing a zonal mean.
(Fortran, C, C++)
2) The data need to be subjected to non-numerical “bookkeeping”
algorithms. Associated metadata needs to be read, matched, sliced,
diced, acted upon and committed to databases. (Perl)
3) Modern systems automatically the schedule processing on to
lots of inexpensive, commodity CPUs. This complicates using
managed (i.e. licensed) commercial runtime environments, e.g.
IDL, Matlab, Mathematica.
4) Satellite processing systems are dynamic. They need
customization several times over the course of each instrument's
life. Development time needs to be short. (Perl, IDL, Matlab,
Mathematica)
5) Data processing systems need to track data provenance.
A Remote Sensing Data Processing Challenge...

Problem:
We lack a programming language which spans the breath of
our remote sensing data processing needs, i.e. Which can
handle both numerical and non-numerical processing
(including HDF5) and that has a short development cycle.

Goal:
Find a programming language which allows fast development
and is adept at numerical processing as well as non-numerical
processing. “Adept” doesn't mean “best”, just pretty good.
...and the Python Programming Language
Why Python fits the bill better than:
Fortran
As an interpreted language with dynamic variable bindings
Python offers more flexibility during development. The
multidimensional array extension (numpy) provides efficient and
competitive array operations. It is more similar to IDL or
Matlab...
IDL or Matlab
No license required which can save money on several levels.
Python language can be more well suited to non-numerical
data processing tasks such as are encountered in production
processing of remote sensing. It is more similar to Perl...
Perl/PDL
Nicer syntax for array operations.
More complete HDF5 support (h5py vs PDL HDF5).
Some elements of Python
Python is an interpreted language. Python
statements can easily be run from an interactive
prompt. Python programs are written in ASCII files.

Python has many of the same elements as programming

languages used for numerical processing (I.e. loops,
conditionals, etc) plus...
Name spaces

Lists

Dictionaries
Python has name spaces
Name spaces allow related functions and data to be grouped together
in a way that will not conflict with unrelated functions and data which
happens to be named the same.
Name space are very useful, but are only relevant here because
Python name spaces correspond to Python modules. The
numerical and HDF5 routines we will see later are implemented
as Python modules.
For example, to open an HDF5 file in Python we first import the h5py
module and then we open the file using the File function of the h5py
module.
import h5py # First import the module.
FileID = h5py.File('hdf5_test.h5','r')#Now use the File function,
#note the module name in red
.
.
.

We could import other modules simultaneously which have an unrelated

function called File and there would be no conflict or error.
Python has Lists:
These are 1D arrays which are useful for solving many problems.
List index notation looks like 1D array notation in many languages.

>>> MyList = [1.0,70] # initialize variable to 2 element list

>>> MyList.append("Demo Data!") # Append an element, NOT like fortran
>>> MyList.append(MyList[0] + MyList[1]) # Append sum of first 2 elements
>>> print MyList
[1.0, 70, 'Demo Data!', 71.0]

Python has Dictionaries:

(Aka hash tables) These map symbols to objects in analogy to
how a 1D list maps an integer index value to an object.

>>> MyDictionary = {} # Initialize empty Dictionary

>>> MyDictionary[0] = 1
>>> MyDictionary['Gadzooks'] = 70
>>> MyDictionary['Kind of Data'] = 'Demo Data!'
>>> MyDictionary['Result'] = MyDictionary[0] + MyDictionary['Gadzooks']
>>> print MyDictionary
{0: 1, 'Kind of Data': 'Demo Data!', 'Result': 71, 'Gadzooks': 70}
Python Lists vs Python Dictionaries vs Arrays in Fortran

Index to a List must be an integer, but a Dictionary can be

indexed by an integer or string.

MyList[Integer] vs. MyDictionary[Integer or String]

The objects referenced by a List or Dictionary can be any Python

object. This is in contrast to more traditional languages, eg. The
elements Fortran array must be of a particular type such as
real*4.

MyList = [34,”String Datum”]

or
MyDictionary={'FirstKey':34,'SecondKey':”String Datum”}
The List and Dictionary data structures make it possible to write
flexible programs very quickly. However, they are not good for
numerical computation. The variety of objects (number, string, etc)
which they can reference makes computation slow.

For example, adding array elements: MyList[0] + MyList[1]

Python must check at run time if MyList[0] and MyList[1] are
objects that can be added together, and not, say, a number and a
string. In a loop this check is performed at every iteration!

What to do?

Enter Numpy
Numpy is a package for use with Python which provides
multidimensional arrays of numeric (and other) types and extends
Python syntax to provide a familiar interface to the arrays.

Numpy extends Python syntax to allow the expression of vector

operations similar to those in IDL, Matlab, or Fortran (>=90).
Numpy Example: Create, Print, and add two 2D arrays

Build from Python lists of lists of elements (the module name is numpy)...
>>> a = numpy.array([[1,2,3],[4,5,6]])
>>> print a
[[1 2 3]
[4 5 6]]
Build from dimension sizes...
>>> b = numpy.ones([2,3])
>>> print b
[[ 1. 1. 1.]
[ 1. 1. 1.]]
Print a selected element...
>>> print a[1,2]
6

Example: Add two dimensional arrays.

>>> print a+b
[[ 2. 3. 4.]
[ 5. 6. 7.]]
>>>
The HDF5 connection: H5py

H5py is an Python-HDF5 interface is a Python module

written by Andrew Collette. Its design allows the use of
familiar Python structures for working with HDF5 files.

The interface maps Python syntax for familiar Python

objects to similar objects in the HDF5 file.
Here is a familiar example HDF5 file
from the HDFView distribution:
Here is how to read the 3D int array using
h5py.

>>> import h5py

>>> fid = h5py.File('hdf5_test.h5','r')
>>> group = fid['arrays']
>>> The3DArray = group['3D int array'].value
>>> fid.close()
>>> The3DArray
array([[[ 174, 27, 0, ..., 102, 71, 100009
[ 171, 27, 0, ..., 194, 79, 100109
[ 172, 27, 0, ..., 102, 55, 100209
...,
Equivalence of HDF5 Groups and Python Dictionaries
Print value of dictionary entry:

>>> MyDictionary =
{'RandArray':numpy.random.random([2,2])}
>>> print MyDictionary['RandArray']
[[ 0.82066938 0.39219683]
[ 0.86546443 0.91276533]]

Print value of HDF5 file entry:

>>> fid = h5py.File('RandomArray.h5','r')

>>> print fid['RandArray'].value
[[ 0.1 3.14152908]
[ 2.71828008 0. ]]
Simple Real world example:

Goal: Retrieve HDF5 file from Configuration Management (CM)

and insert CM metadata into HDF5 file.

We want to place the CM path and revision number inside the

HDF5 file as an attribute.
Python script to retrieve file from CM and store Rev
number as attribute.
#! /usr/bin/env python

import sys
import os
import h5py

Rev = sys.argv[1] # Specifiy CM path on command line

SVNFilepath = sys.argv[2] # Specify revision number on
#comand line.

command = 'svn export -r ' + Rev + ' ' + SVNFilepath #Subversion

# Command
InStream = os.popen(command,'r')
ExportString = InStream.read()
ExportReturnCode = InStream.close()
Elements = SVNFilepath.split('/')

# HDF5 code

fid = h5py.File(Elements[-1]) # Elements[-1] is file name

fid.attrs['SVN Path and Revision'] = SVNFilepath + '@' + Rev

fid.close()

H5py code in red. Note the minimal effort coding HDF5 calls.
Another real world example (if NPOESS is your real world).

We have had several occasions to do data aggregation on HDF5 files for the
OMPS Limb instrument.

Our retrieval code (Fortran) processes an orbit of data as 480 distinct pieces
and places the results into 480 distinct HDF5 files. We wish to aggregate the
files such that an N dimensional array in a unaggregated file becomes a N+1
dimensional array in the aggregated HDF5 file.

The Fortran code is (mostly) the product of the retrieval scientist, while the
aggregation is a requirement of the production data processing system. It makes
sense to aggregate as a post-processing step to the Fortran code so as to
minimize changes to the original Fortran code.
Aggregation Algorithm

1) Input is list of HDF5 files

2) Analyze structure of one file to generate list of fully qualified dataset names,
dimensions, and type (In the code I use a Python Dictionary and not a list).
3) Assume all files have that structure. Read corresponding datasets (of dim N)
from each file into aggregation variable (of dim N+1).
4) After corresponding datasets have been read from all files write aggregation
variable to HDF5 file.
5) Repeat 3 until all datasets have been aggregated.

Array 4
File 1 Array File 2 Array File 3 Array File 4 Array Array 3
Array 2
Array 1
+ + +

Schematic of Array Aggregation for One Dataset

# Data Aggregator. This script takes a set of HDF5 files as input.
# The files are expected to have identical struture. All the
# corresponding arrays in the input files are combined into an array
# which has dimensions N+1, where N is the number of dimensions in the
# original, constituent arrays.

import sys
import h5py
import numpy

Files = sys.argv[1:] # Get file names from command line

# First step is to select one file and create a map of the shape and
# data type of all datasets. This is naturally done via a recursive
# function, called VisitAllObjects
FirstFid = h5py.File(Files[0],'r') # Open first HDF5 file

FileInfo = {} # Initialize FileInfo to be a dictionary. We will use it to build a mapping from

# dataset name to a tuple containing shape and type of dataset.
# EVALUATE HDF5 HIERARCHY
# Evaluating a a hierarchy is naturally a recursive process so we define a function....
def VisitAllObjects(Group,Path):
for i in Group.items():
if isinstance(i[1],h5py.Group):
VisitAllObjects(i[1],Path + '/' + i[0])
else:
DatasetName = Path + '/' + i[0]
FileInfo[DatasetName] = (Group[DatasetName].shape, Group[DatasetName].dtype)

VisitAllObjects(FirstFid,'')
FirstFid.close()
# Print dataset paths and info to screen
for (k,v) in FileInfo.items():
print k,v
# AGGREGATE DATA
# Now that we understand the file structure we can perform the aggregation.
OutputFileID = h5py.File('AggregatedData.h5','w')
NumberOfFiles = len(Files)

# Here is the meat of the code. The outer loop is over datasets, the inner
over all files.
for Dataset in FileInfo.keys():
AggregatedData =
numpy.ndarray(FileInfo[Dataset][0]+(NumberOfFiles,),dtype =
FileInfo[Dataset][1])
for FileNumber in range(NumberOfFiles):
# Open file, read data into aggregation array, and close
fid = h5py.File(Files[FileNumber],'r')
AggregatedData[...,FileNumber] = fid[Dataset].value
fid.close()
Path = Dataset.split('/')
map((lambda(x): OutputFileID.require_group(x)), Path[1:-1])
#OutputFileID[Dataset] = AggregatedData

OutputFileID.create_dataset(Dataset,data=AggregatedData,compression=5,chunks=
FileInfo[Dataset][0]+(1,))

OutputFileID.close()
Original, Unaggregated Data Field
$ h5dump -H -d /ANCILLARY_DATA/GeopotentialHeight_NCEP \
Data/OMPS_LP_SDR_20041121_55_146_-69_-119.h5
HDF5 "Data/OMPS_LP_SDR_20041121_55_146_-69_-119.h5" {
DATASET "/ANCILLARY_DATA/GeopotentialHeight_NCEP" {
DATATYPE H5T_IEEE_F32LE
DATASPACE SIMPLE { ( 5, 3, 21 ) / ( 5, 3, 21 ) }
ATTRIBUTE "Title" {
DATATYPE H5T_STRING {
STRSIZE 25;
}
Note 3 dimensions
DATASPACE SCALAR
}
ATTRIBUTE "Units" {
DATATYPE H5T_STRING {
STRSIZE 2;
}
DATASPACE SCALAR
}
ATTRIBUTE "_FillValue" {
DATATYPE H5T_IEEE_F32LE
DATASPACE SIMPLE { ( 1 ) / ( 1 ) }
}
}
Aggregated Data Field

$ h5dump -H -d /ANCILLARY_DATA/GeopotentialHeight_NCEP \
AggregatedData.h5
HDF5 "AggregatedData.h5" {
DATASET "/ANCILLARY_DATA/GeopotentialHeight_NCEP" {
DATATYPE H5T_IEEE_F32LE
DATASPACE SIMPLE { ( 5, 3, 21, 4 ) / ( 5, 3, 21, 4 ) }
}
}

Now we have 4 dimensions. The new

dimension has extent 4, corresponding to
the number of input files.

Note that none of the attributes were copied.

This is a bug, but easily fixed.
To fix the bug, I assume attributes (like “Units”) are not
aggregated, and I take attributes and values from first file.

New code consists of only 2+ additional lines, shown below

in green.
def VisitAllObjects(Group,Path):
for i in Group.items():
if isinstance(i[1],h5py.Group):
VisitAllObjects(i[1],Path + '/' + i[0])
else:
DatasetName = Path + '/' + i[0]
FileInfo[DatasetName] = (Group[DatasetName].shape,
Group[DatasetName].dtype,
Group[DatasetName].attrs.listitems())

And also....
DS=OutputFileID.create_dataset(Dataset,data=AggregatedData,compression=5,chunks=F
[DS.attrs.__setitem__(Attribute[0],Attribute[1]) for Attribute in
FileInfo[Dataset][2]]
Fixed output:
$ h5dump -H -d /ANCILLARY_DATA/GeopotentialHeight_NCEP \
AggregatedDataWithAttributes.h5
HDF5 "AggregatedDataWithAttributes.h5" {
DATASET "/ANCILLARY_DATA/GeopotentialHeight_NCEP" {
DATATYPE H5T_IEEE_F32LE
DATASPACE SIMPLE { ( 5, 3, 21, 4 ) / ( 5, 3, 21, 4 ) }
ATTRIBUTE "Title" {
DATATYPE H5T_STRING {
STRSIZE 25;
CTYPE H5T_C_S1;
}
DATASPACE SCALAR
}
Now we have attributes.
ATTRIBUTE "Units" {
DATATYPE H5T_STRING {
STRSIZE 2;
CTYPE H5T_C_S1;
}
DATASPACE SCALAR
}
ATTRIBUTE "_FillValue" {
DATATYPE H5T_IEEE_F32LE
DATASPACE SIMPLE { ( 1 ) / ( 1 ) }
}
}
}
Summary:
Python offers a high degree of flexibility for code
development, combined with the ability to do easy
text, numerical array and HDF5 coding make it a
good candidate for solving problems in satellite
remote sensing data processing.

Few, if any, other computer languages offer this

combination of benefits making it uniquely suited for
this task.
Future Work:
Tracking module version provenance is likely one of the
outstanding questions for Python use in production.

Acknowledgment:
Curt Tilmes at NASA Goddard funded this work via
contract NNG06HX18C task 614.5-01-07

Full Inyection NEW Holland
100% (1)
Full Inyection NEW Holland
66 pages
Valerenic Acid Isolation 1 PDF
No ratings yet
Valerenic Acid Isolation 1 PDF
8 pages
Useful Concepts For Decline-Curve Forecasting, Reserve Estimation, and Analysis
No ratings yet
Useful Concepts For Decline-Curve Forecasting, Reserve Estimation, and Analysis
10 pages
TP/TH/TG/TE series HMI manual【Hardware】
No ratings yet
TP/TH/TG/TE series HMI manual【Hardware】
45 pages
Future For Scientific Computing Using Python
No ratings yet
Future For Scientific Computing Using Python
7 pages
Advance Python Programming
No ratings yet
Advance Python Programming
46 pages
Bcse206l Fds Module-5 Smsatapathy
No ratings yet
Bcse206l Fds Module-5 Smsatapathy
74 pages
Ass1 DSBDA Writeup
No ratings yet
Ass1 DSBDA Writeup
8 pages
HDF5 and H5py
No ratings yet
HDF5 and H5py
26 pages
Glossary Working With Data in Python
No ratings yet
Glossary Working With Data in Python
2 pages
Unit 7: Problem Solving Real World Programming Problems
No ratings yet
Unit 7: Problem Solving Real World Programming Problems
36 pages
Data Analysis With Python
100% (3)
Data Analysis With Python
49 pages
Python Programming1
No ratings yet
Python Programming1
27 pages
Python U 5 ONE SHOT Notes
No ratings yet
Python U 5 ONE SHOT Notes
80 pages
Ct3 QB Answers
No ratings yet
Ct3 QB Answers
8 pages
ASM_135233
No ratings yet
ASM_135233
3 pages
Data Ty
No ratings yet
Data Ty
59 pages
Data Sceince Lab Manual
No ratings yet
Data Sceince Lab Manual
64 pages
FDS Lab Meterial CS3361
No ratings yet
FDS Lab Meterial CS3361
30 pages
Numpy Tutorial
No ratings yet
Numpy Tutorial
13 pages
Python
No ratings yet
Python
20 pages
ML With Python Lab (MCA)
No ratings yet
ML With Python Lab (MCA)
36 pages
Unit Vi
No ratings yet
Unit Vi
60 pages
Python Week+1 New
No ratings yet
Python Week+1 New
44 pages
Fds Lab Manual
No ratings yet
Fds Lab Manual
31 pages
Grace Python Numpy MB
No ratings yet
Grace Python Numpy MB
56 pages
Python UNIT 1
No ratings yet
Python UNIT 1
17 pages
Module 1.Foundations of Data Science
No ratings yet
Module 1.Foundations of Data Science
17 pages
Python Libraries
No ratings yet
Python Libraries
77 pages
Scientific Computing With Python - Advanced Topics
No ratings yet
Scientific Computing With Python - Advanced Topics
94 pages
Python Basic Data Analysis 20180412
No ratings yet
Python Basic Data Analysis 20180412
53 pages
Glosario m4
No ratings yet
Glosario m4
2 pages
TY FDS Workbook
No ratings yet
TY FDS Workbook
56 pages
Data Analysis Using Python: Numpy Library
No ratings yet
Data Analysis Using Python: Numpy Library
3 pages
Cs3361 Data Science Laboratory
No ratings yet
Cs3361 Data Science Laboratory
139 pages
NumPy and Pandas
No ratings yet
NumPy and Pandas
72 pages
Data Science Tools
No ratings yet
Data Science Tools
2 pages
Python Libraries Seminar Report
100% (2)
Python Libraries Seminar Report
16 pages
CS3361 - Data Science
No ratings yet
CS3361 - Data Science
56 pages
Fods Final Done
No ratings yet
Fods Final Done
67 pages
CH - 2 Advance Python
No ratings yet
CH - 2 Advance Python
47 pages
MTE204 Data Python
No ratings yet
MTE204 Data Python
45 pages
PP - Chapter - 8
No ratings yet
PP - Chapter - 8
112 pages
Python For Data Science
No ratings yet
Python For Data Science
8 pages
Practical Manual 6
No ratings yet
Practical Manual 6
38 pages
Data Science Lab Manual
No ratings yet
Data Science Lab Manual
18 pages
Introduction To Python
No ratings yet
Introduction To Python
53 pages
Final Fds Manual Print
No ratings yet
Final Fds Manual Print
55 pages
Glossary - Working With Data in Python
No ratings yet
Glossary - Working With Data in Python
2 pages
DSBDA Lab Manual
No ratings yet
DSBDA Lab Manual
155 pages
MDA File
No ratings yet
MDA File
37 pages
HDF5 in Python - The Future of Large Dataset Storage
No ratings yet
HDF5 in Python - The Future of Large Dataset Storage
11 pages
Numerical Python v0.3
No ratings yet
Numerical Python v0.3
52 pages
A Report Submitted in Partial Fulfillment of The Requirement of The Award of Degree of
No ratings yet
A Report Submitted in Partial Fulfillment of The Requirement of The Award of Degree of
35 pages
Final Fds Manual
No ratings yet
Final Fds Manual
77 pages
Programming For Data Science
No ratings yet
Programming For Data Science
48 pages
HDF5 tutorialNUG2010
No ratings yet
HDF5 tutorialNUG2010
112 pages
Machine Learning Codes
No ratings yet
Machine Learning Codes
30 pages
Ch02 Statlearn Lab
No ratings yet
Ch02 Statlearn Lab
58 pages
G10 Python 2
No ratings yet
G10 Python 2
64 pages
Py Tables
No ratings yet
Py Tables
143 pages
Week 1 Lesson on Energy Consumption and Conservation Pie Chart
No ratings yet
Week 1 Lesson on Energy Consumption and Conservation Pie Chart
1 page
Pore Network
No ratings yet
Pore Network
24 pages
Rubiks Cube Solving Guide
No ratings yet
Rubiks Cube Solving Guide
2 pages
Numpy 2
No ratings yet
Numpy 2
24 pages
Dhan Van Tar i Hindu God
No ratings yet
Dhan Van Tar i Hindu God
8 pages
Sulba Sutra of Vedic India and Pythagor PDF
No ratings yet
Sulba Sutra of Vedic India and Pythagor PDF
14 pages
Aikyamatya Suktam With Transliterationtranslation Rev161216
No ratings yet
Aikyamatya Suktam With Transliterationtranslation Rev161216
2 pages
Sri Suktam Path Benefits
No ratings yet
Sri Suktam Path Benefits
3 pages
Empathy at Work - Developing Skills To Understand Other People
100% (1)
Empathy at Work - Developing Skills To Understand Other People
3 pages
32 - Forms - Lord-Ganesh
No ratings yet
32 - Forms - Lord-Ganesh
16 pages
Assertive I Messages Handout
No ratings yet
Assertive I Messages Handout
2 pages
Pithrukarmam Full
No ratings yet
Pithrukarmam Full
5 pages
Sapthasudhi
No ratings yet
Sapthasudhi
5 pages
Bali Kriya
No ratings yet
Bali Kriya
4 pages
Presenation PHD Defence
No ratings yet
Presenation PHD Defence
39 pages
5.1.production Forecast
No ratings yet
5.1.production Forecast
6 pages
Punyaham 3
No ratings yet
Punyaham 3
1 page
1 6 Presentation Shell Open Server Macros
No ratings yet
1 6 Presentation Shell Open Server Macros
20 pages
FacultyAdvt2018 PDF
No ratings yet
FacultyAdvt2018 PDF
5 pages
Homi J Bhabha PDF
No ratings yet
Homi J Bhabha PDF
84 pages
Homi J Bhabha PDF
No ratings yet
Homi J Bhabha PDF
84 pages
Autocad and Lisp
No ratings yet
Autocad and Lisp
65 pages
Advmat S 24 22807 3
No ratings yet
Advmat S 24 22807 3
36 pages
TMS - Business Analysis Review: Transportation Corridor
No ratings yet
TMS - Business Analysis Review: Transportation Corridor
3 pages
Anchorbolts
100% (1)
Anchorbolts
2 pages
Impact of Socio-Economic Status On Academic Achievement of University Students Case Study Erbil City
100% (1)
Impact of Socio-Economic Status On Academic Achievement of University Students Case Study Erbil City
13 pages
(529943) Beat - That Pages 8 13
No ratings yet
(529943) Beat - That Pages 8 13
6 pages
Source Code For Chatbot
No ratings yet
Source Code For Chatbot
22 pages
Respiration in Text Activities& Questions
No ratings yet
Respiration in Text Activities& Questions
4 pages
CC 7
No ratings yet
CC 7
20 pages
B.Tech. - EIE - R13 - Syllabus PDF
No ratings yet
B.Tech. - EIE - R13 - Syllabus PDF
109 pages
Croissant Recipe Adapted From Julia Child
No ratings yet
Croissant Recipe Adapted From Julia Child
16 pages
Chap 1
No ratings yet
Chap 1
113 pages
GY303 Igneous & Metamorphic Petrology
No ratings yet
GY303 Igneous & Metamorphic Petrology
16 pages
DC2533 Operating Procedures GT Reverse Pressure 1 in - 6in
No ratings yet
DC2533 Operating Procedures GT Reverse Pressure 1 in - 6in
14 pages
Krisshana Kannan - S1 Purity - ScienceAA1
No ratings yet
Krisshana Kannan - S1 Purity - ScienceAA1
4 pages
Class 12 Semiconductors PYQs
No ratings yet
Class 12 Semiconductors PYQs
17 pages
Process Technology
No ratings yet
Process Technology
31 pages
Helmet Detection and License Plate Recognition Using CNN
No ratings yet
Helmet Detection and License Plate Recognition Using CNN
54 pages
Siddhanta Shiromani CH 9 Translated
No ratings yet
Siddhanta Shiromani CH 9 Translated
9 pages
ASC 57 (4) 286-Šimunović
No ratings yet
ASC 57 (4) 286-Šimunović
14 pages
Thermal Processing of Mango Nectar and Its Effect On Chemical, Microbiological and Sensory Quality Characteristics
No ratings yet
Thermal Processing of Mango Nectar and Its Effect On Chemical, Microbiological and Sensory Quality Characteristics
13 pages
Representasi Data Dan Alur Pemrosesan Data
No ratings yet
Representasi Data Dan Alur Pemrosesan Data
36 pages
Analysis Method For Starch: Japan Customs Analysis Methods
No ratings yet
Analysis Method For Starch: Japan Customs Analysis Methods
3 pages
Tanishq Worksheet Python
No ratings yet
Tanishq Worksheet Python
4 pages
Phy BK Ans 2
No ratings yet
Phy BK Ans 2
93 pages
Kruskal Wallis or H-Test
No ratings yet
Kruskal Wallis or H-Test
11 pages
Ellie Ragland "Counting From 0 To 6 - Lacan"
100% (1)
Ellie Ragland "Counting From 0 To 6 - Lacan"
27 pages

H5py Python

Uploaded by

H5py Python

Uploaded by

The Python Programming Language and HDF5: H5py.

Application to Satellite Remote Sensing Data Processing.

Needs of satellite remote sensing data processing and

Andrew Collette, UCLA, author of h5py

Francesc Alted, author of PyTables and the

Andrew also requested acknowledgment for Darren

Python has many of the same elements as programming

We could import other modules simultaneously which have an unrelated

>>> MyList = [1.0,70] # initialize variable to 2 element list

Python has Dictionaries:

>>> MyDictionary = {} # Initialize empty Dictionary

Index to a List must be an integer, but a Dictionary can be

MyList[Integer] vs. MyDictionary[Integer or String]

The objects referenced by a List or Dictionary can be any Python

MyList = [34,”String Datum”]

For example, adding array elements: MyList[0] + MyList[1]

Numpy extends Python syntax to allow the expression of vector

Example: Add two dimensional arrays.

H5py is an Python-HDF5 interface is a Python module

The interface maps Python syntax for familiar Python

>>> import h5py

Print value of HDF5 file entry:

>>> fid = h5py.File('RandomArray.h5','r')

Goal: Retrieve HDF5 file from Configuration Management (CM)

We want to place the CM path and revision number inside the

Rev = sys.argv[1] # Specifiy CM path on command line

command = 'svn export -r ' + Rev + ' ' + SVNFilepath #Subversion

fid = h5py.File(Elements[-1]) # Elements[-1] is file name

fid.attrs['SVN Path and Revision'] = SVNFilepath + '@' + Rev

1) Input is list of HDF5 files

Schematic of Array Aggregation for One Dataset

Files = sys.argv[1:] # Get file names from command line

FileInfo = {} # Initialize FileInfo to be a dictionary. We will use it to build a mapping from

Now we have 4 dimensions. The new

Note that none of the attributes were copied.

New code consists of only 2+ additional lines, shown below

Few, if any, other computer languages offer this

You might also like