0% found this document useful (0 votes)
17 views1 page

Capstone Reading Format

This research examines using Hadoop and MapReduce for near real-time processing of proteomics data. The researchers investigated converting raw mass spectrometer data files into XML format and using MapReduce tasks for 2D and 3D peak picking. The results showed MapReduce was well-suited for this type of data processing and distribution. Future work could focus on additional proteomics analysis techniques within this framework.

Uploaded by

Jonbert Andam
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
17 views1 page

Capstone Reading Format

This research examines using Hadoop and MapReduce for near real-time processing of proteomics data. The researchers investigated converting raw mass spectrometer data files into XML format and using MapReduce tasks for 2D and 3D peak picking. The results showed MapReduce was well-suited for this type of data processing and distribution. Future work could focus on additional proteomics analysis techniques within this framework.

Uploaded by

Jonbert Andam
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 1

FlordelineA.

Cadeliña IT ResearchMethods October 22,2016 Research Summary


Revised 2.2
Title: NEAR REAL-TIME PROCESSING OF PROTEOMICS DATA USING HADOOP
by: Chris Hillman, Yasmeen Ahmad, Mark Whitehorn, and Andy Cobley

Area Type of # of Data Method(s) Theoretical Problem/Issue Results of the Additional


Recommendation
Context Research Examined Used Foundation being addressed Study Knowledge/Insights

Near real- Qualitative 4 (Hadoop, Java Investigation XML file Solution on data The 2D and 3D There are many A properly
time Research Code, using the conversion, management and peak-picking areas still to be designed and
processing MapReduce, 2D technique 2D peak processing facing process fits very researched in this researched process
solution and 3D peaks) from the big picking, De- the life sciences well into the process, will allow future
using data. isotoping 2D community on MapReduce including the work to take
MapReduce peaks, data that is Programming SILAC pair/triplet advantage of
and Data 3D peak preprocessed framework. detection and, technical
Hadoop. processing on picking, and before any importantly, developments
mass 3D isotopic biological insight. Data to be the database without
spectrometer envelopes redistributed on search that having to
as raw file in a dataset that has identifies the revalidate and
vendor binary been greatly peptides by their redesign the
format that reduced by the mass methodology
will convert map task. and ties the for processing raw
files into XML peptides to a mass spectrometer
and mzML given protein. data into
format. actionable
The process information.
coded in the
MapReduce
framework will
allow
timings to be
taken and
compared across
platforms and
Hadoop
configurations.

You might also like