Challenge Remote

remote challenge

Uploaded by

Anas Jamshed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views4 pages

Challenge Remote

remote challenge

Uploaded by

Anas Jamshed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Challenge

Welcome to our ions.bio Data Challenge! All tasks should be analyzed using Python,
which is our language of choice. In total, you should not spend more than 3 hours on
the exercise, although a complete solution may take much longer. There is no right or
wrong answer. For us, it is only relevant to see how you approach such a task, how
creative you may get during the data analysis, and mainly how you communicate the
results. There is no need to polish the code or results. Most important is that you have
fun and enjoy solving the problems!

If you feel that you cannot answer a task due to specific domain knowledge limitations,
feel free to skip this task or send an email to [email protected] for additional information.

Good luck, and we look forward to seeing your innovative solutions and insights!

Tasks:
Task 1: Truck company
Task 2: Mass spectrometer control
Task 1: Truck company
Mega Truck is a logistics company specializing in shipping services, operating with a
fleet of 10 trucks that regularly travel to designated destinations. The roundtrip
distances for these routes are detailed in routes.csv. Over time, you need to gather
data on the routes each truck took. Unfortunately, all records of the trucks' trips were
lost. However, we do know that each truck’s trip consisted of completing 8 routes.

Each truck is fitted with a tracking device that logs the mileage every time the engine
is turned off, creating a new entry. This means that a log entry is recorded each time
a truck returns to the home base. Additional entries may also be logged, such as
during cargo unloading, refueling or bio-breaks.

You can find the mileage logs in logs.csv.

There is a receipt that correspond to one of the trucks. These receipts allowed us to
determine the truck's destinations, as recorded in sample_trip.csv.

1) Can you identify the truck for which we have the recorded routes in the
sample_trip.csv?
2) Can you reconstruct the trips for each of the trucks?
Task 2: Mass spectrometer control
In the past, we analyzed a blood sample with our mass spectrometer and found 10.000
peptides. For each peptide, we acquired its charge, its mass, its fragments, and its
time. In the file library.csv, every row corresponds to a peptide that we measured.
The total measurement time was 120 minutes. After applying some special algorithms,
we could identify the peptides and marked some as important peptides that are
relevant for a certain disease.

Next, we want to measure another blood sample. This time we want to apply special
instrument settings so that whenever we measure one of the important peptides, we
adjust our instrument so that we can measure with maximum performance.

For the sake of simulation, we added the file measurement.csv – which contains all
data points that will be measured during this measurement run. While measuring, we
will get one datapoint after another over the time course of 120 minutes (Note that the
rows are sorted by time). Below is some Python-code that would simulate the
instrument streaming the data. Also note that the values of our recent measurements
are not identical to the expected values due to measurement inaccuracy and noise.
Particularly for the time, we expect some drift over the measurement. Typically there
is a fixed offset between measurements as well as a variable shift over time.

Our goal is now to incorporate some logic (e.g., the function is_important()) that checks
if a measurement is an important one from the library so that we can trigger the do()
function and apply our special instrument settings.

import pandas as pd
library = pd.read_csv('library.csv')
sample = pd.read_csv('measurement.csv')
def instrument_gen():
for i in range(len(sample)):
yield sample.iloc[i]
instrument = instrument_gen()
def do():
pass
def is_important(measurement):
pass
while True:
try:
measurement = next(instrument)
# if the measurement is important do stuff
if is_important(measurement):

do()

except StopIteration:
break

1) Can you come up with a solution that is capable of detecting the important
measurements during the acquisition?
2) The time drift will be potentially key for looking up the measurements in the
library. Can you visualize the observed drift in time?

PDS Lab Manual - 23 Om
No ratings yet
PDS Lab Manual - 23 Om
97 pages
Dsbda Lab Manual
No ratings yet
Dsbda Lab Manual
167 pages
Data Science Lab Manual
No ratings yet
Data Science Lab Manual
85 pages
Pds Leb Manual
No ratings yet
Pds Leb Manual
54 pages
DSBDA LAB - MANUAL (Autosaved) - Sd1-Converted-1-2
100% (1)
DSBDA LAB - MANUAL (Autosaved) - Sd1-Converted-1-2
256 pages
PyRINEX Manuel
No ratings yet
PyRINEX Manuel
14 pages
Curso 2 Data in Out Listas
No ratings yet
Curso 2 Data in Out Listas
30 pages
DSBDA Lab Manual
No ratings yet
DSBDA Lab Manual
167 pages
DS Journal
No ratings yet
DS Journal
46 pages
DS Lab Manual Final
No ratings yet
DS Lab Manual Final
49 pages
Dsbda Lab Manual Merged
No ratings yet
Dsbda Lab Manual Merged
117 pages
CS3362 Data Science Laboratory Manual 2022-23
No ratings yet
CS3362 Data Science Laboratory Manual 2022-23
54 pages
SL-III Lab Manual
No ratings yet
SL-III Lab Manual
74 pages
1152CS239-Intro. To Data Science-Syllabus
No ratings yet
1152CS239-Intro. To Data Science-Syllabus
6 pages
Experiment No: 1 Title:: Creating Vectors and Data Frames and Implementing Data Summary Functions
No ratings yet
Experiment No: 1 Title:: Creating Vectors and Data Frames and Implementing Data Summary Functions
8 pages
Lab Manual Ds&Bdal
No ratings yet
Lab Manual Ds&Bdal
100 pages
04 DS 2023
No ratings yet
04 DS 2023
63 pages
DS Manual-1
No ratings yet
DS Manual-1
29 pages
Homework 4
No ratings yet
Homework 4
7 pages
Mini Project: Hanoi University of Science and Technology EE3490E Fundamentals of Embedded Programming
No ratings yet
Mini Project: Hanoi University of Science and Technology EE3490E Fundamentals of Embedded Programming
5 pages
Aids - 21ad62 - Datascience Lab Manual-1
No ratings yet
Aids - 21ad62 - Datascience Lab Manual-1
15 pages
Data Science Practicals
No ratings yet
Data Science Practicals
40 pages
Mini Project 20232 en
No ratings yet
Mini Project 20232 en
7 pages
ML Lab Manual 2024
No ratings yet
ML Lab Manual 2024
41 pages
PRACTICAL QUESTIONS For DSBDA
No ratings yet
PRACTICAL QUESTIONS For DSBDA
9 pages
Python Programming Lab Manual
No ratings yet
Python Programming Lab Manual
52 pages
Sessional QP-TaT
No ratings yet
Sessional QP-TaT
5 pages
DSBDAlab Manual
No ratings yet
DSBDAlab Manual
116 pages
Data Science Lab Group Submission
No ratings yet
Data Science Lab Group Submission
13 pages
Data Science Lab Group Submission
No ratings yet
Data Science Lab Group Submission
13 pages
PDS Exp 1 To 3
No ratings yet
PDS Exp 1 To 3
17 pages
Exercise1 Problem
No ratings yet
Exercise1 Problem
2 pages
PPT2
No ratings yet
PPT2
14 pages
Data Science and Its Applications (21AD62) Lab Manual
No ratings yet
Data Science and Its Applications (21AD62) Lab Manual
26 pages
Some Exercises
No ratings yet
Some Exercises
9 pages
DSBDA Manual
No ratings yet
DSBDA Manual
76 pages
DBDAL LAB - MANUAL - Final
No ratings yet
DBDAL LAB - MANUAL - Final
93 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
MP 1
No ratings yet
MP 1
2 pages
Lab 9
No ratings yet
Lab 9
2 pages
Data Science in Society Cat
No ratings yet
Data Science in Society Cat
5 pages
DSV Manual Final
No ratings yet
DSV Manual Final
47 pages
Python For Data Sceince l1 Hands On
No ratings yet
Python For Data Sceince l1 Hands On
5 pages
Assignment
No ratings yet
Assignment
3 pages
DATASCIENCE
No ratings yet
DATASCIENCE
3 pages
Mini Project 20242 en Rev1.1
No ratings yet
Mini Project 20242 en Rev1.1
10 pages
Data Analytics QP May 25
No ratings yet
Data Analytics QP May 25
4 pages
Proj2425 en
No ratings yet
Proj2425 en
6 pages
M3 Instructions
No ratings yet
M3 Instructions
6 pages
Web Data Analytics: Instructions
No ratings yet
Web Data Analytics: Instructions
2 pages
Micron AI Competition
No ratings yet
Micron AI Competition
4 pages
IDS Syllabus
No ratings yet
IDS Syllabus
5 pages
Safety Inspection Checklist: A. General-All Items
No ratings yet
Safety Inspection Checklist: A. General-All Items
5 pages
Design of Box Culvert
No ratings yet
Design of Box Culvert
11 pages
Instructions To Authors
50% (2)
Instructions To Authors
4 pages
Cathodic Protection
No ratings yet
Cathodic Protection
132 pages
Pulverizer Plant
No ratings yet
Pulverizer Plant
35 pages
(400t) Demag HC 1010
100% (1)
(400t) Demag HC 1010
26 pages
Amine Absorber
No ratings yet
Amine Absorber
4 pages
Deposit Reciept
100% (1)
Deposit Reciept
2 pages
Film and Television Institute of India, Pune Television Wing Syllabus & Detailed Curriculum of
No ratings yet
Film and Television Institute of India, Pune Television Wing Syllabus & Detailed Curriculum of
28 pages
300+ TOP CABLES Objective Type Questions and Answers Electrical Engineering Multiple Choice Questions
No ratings yet
300+ TOP CABLES Objective Type Questions and Answers Electrical Engineering Multiple Choice Questions
16 pages
Apar Notes
No ratings yet
Apar Notes
7 pages
A JIT Translator For Oberon
No ratings yet
A JIT Translator For Oberon
57 pages
Dutch Barge For Sale - 69ft X 14ft Tjalk
100% (1)
Dutch Barge For Sale - 69ft X 14ft Tjalk
2 pages
Cassette Tape
No ratings yet
Cassette Tape
12 pages
Talend Questions
No ratings yet
Talend Questions
4 pages
FRP0104 CLT Tieback Packer Procedure
No ratings yet
FRP0104 CLT Tieback Packer Procedure
3 pages
BMU Clarification
No ratings yet
BMU Clarification
8 pages
Compass Module Application Note PDF
No ratings yet
Compass Module Application Note PDF
8 pages
Chapter 2 Review Questions
No ratings yet
Chapter 2 Review Questions
7 pages
NAGA CITY Land Use Information
No ratings yet
NAGA CITY Land Use Information
7 pages
Acoustical Surfaces FABRISORB Brochure
No ratings yet
Acoustical Surfaces FABRISORB Brochure
16 pages
Annual Review 2013 2014
No ratings yet
Annual Review 2013 2014
208 pages
Explanation of Data
No ratings yet
Explanation of Data
3 pages
Article - Feature of Determination - Eng
No ratings yet
Article - Feature of Determination - Eng
8 pages
Text Mining
No ratings yet
Text Mining
13 pages
Requirements For Occupation Certificates For New Buildings
100% (1)
Requirements For Occupation Certificates For New Buildings
1 page
9D Research Group
No ratings yet
9D Research Group
9 pages
Company Overview: Initial Report April 21st, 2008
100% (1)
Company Overview: Initial Report April 21st, 2008
16 pages
Bioinformatics With NGS - Analysis
No ratings yet
Bioinformatics With NGS - Analysis
6 pages
Dokumen - Tips 9702 p1 Superpositionallcompleted
No ratings yet
Dokumen - Tips 9702 p1 Superpositionallcompleted
42 pages
From Scratch: Writing Your Own Functions
No ratings yet
From Scratch: Writing Your Own Functions
15 pages
Toyota Had A Major Problem With Unexplained Acceleratioon
No ratings yet
Toyota Had A Major Problem With Unexplained Acceleratioon
3 pages
CPAR Reviewer PDF
No ratings yet
CPAR Reviewer PDF
4 pages
Overview of Protein Structure
No ratings yet
Overview of Protein Structure
3 pages
1-Kickoff Meeting Template PDF
No ratings yet
1-Kickoff Meeting Template PDF
7 pages
Paul Pro
No ratings yet
Paul Pro
10 pages
OS-T - 1100 Thermal Stress Analysis of A Printed Circuit Board With Anisotropic Material Properties
No ratings yet
OS-T - 1100 Thermal Stress Analysis of A Printed Circuit Board With Anisotropic Material Properties
8 pages
Problem Statement
No ratings yet
Problem Statement
4 pages
Establishing Causality Is Difficult, Whether Conclusion...
No ratings yet
Establishing Causality Is Difficult, Whether Conclusion...
3 pages
Jeetender Joshi (BD)
No ratings yet
Jeetender Joshi (BD)
3 pages
Python Data Science Cookbook
From Everand
Python Data Science Cookbook
Taryn Voska
No ratings yet
Python Data Science Cookbook: Practical solutions across fast data cleaning, processing, and machine learning workflows with pandas, NumPy, and scikit-learn
From Everand
Python Data Science Cookbook: Practical solutions across fast data cleaning, processing, and machine learning workflows with pandas, NumPy, and scikit-learn
Taryn Voska
No ratings yet
Collection of Raspberry Pi Projects
From Everand
Collection of Raspberry Pi Projects
Guillermo Perez Guillen
5/5 (1)
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Useful Python
From Everand
Useful Python
Stuart Langridge
No ratings yet
Learn Programming Using C#
From Everand
Learn Programming Using C#
Taurius Litvinavicius
No ratings yet
Programming Concepts in C++
From Everand
Programming Concepts in C++
Robert Burns
No ratings yet
Python: Advanced Guide to Programming Code with Python
From Everand
Python: Advanced Guide to Programming Code with Python
Charlie Masterson
No ratings yet

Challenge Remote

Uploaded by

Challenge Remote

Uploaded by

Challenge

You can find the mileage logs in logs.csv.

You might also like