0% found this document useful (0 votes)

16 views2 pages

Report

The project developed a Python command-line tool to fetch research papers from PubMed, focusing on studies by authors affiliated with pharmaceutical or biotech companies. It utilized the PubMed API to extract key metadata and structured the results in a CSV file for analysis. The tool is designed for modularity and maintainability, with plans for future enhancements including improved affiliation detection and advanced filtering options.

Uploaded by

mpv09149

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views2 pages

Report

Uploaded by

mpv09149

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 2

# Research Paper Fetcher: Approach, Methodology, and Results

## **1. Introduction**

The objective of this project was to develop a Python-based command-line tool

to fetch research papers from **PubMed**, focusing on identifying studies authored
by researchers affiliated with **pharmaceutical or biotech companies**. The results
were structured and exported into a CSV file for further analysis.

## **2. Approach**

To ensure modularity and maintainability, the project followed a structured

workflow:

1. Fetching Research Papers: Accessing PubMed API based on user queries.

2. **Filtering Non-Academic Authors**: Identifying researchers affiliated with
**pharmaceutical or biotech companies**.
3. **Data Extraction**: Collecting key information like **title, authors,
affiliations, and corresponding author email**.
4. **Exporting Data**: Saving results in a **CSV file**.
5. **Command-Line Interface (CLI)**: Providing user-friendly interaction.
6. **Packaging with Poetry**: Ensuring the tool is well-structured and easily
installable.
7. **Publishing to TestPyPI**: Making the package publicly accessible for testing.

## **3. Methodology**

### 3.1 Fetching Data from PubMed

- Used the PubMed API to fetch research articles.

- Extracted critical metadata:
- **PubmedID**
- **Title**
- **Publication Date**
- **Authors & Affiliations**
- **Corresponding Author Email**

### 3.2 Filtering Industry-Affiliated Authors

- Identified non-academic affiliations based on keywords:

- "Pharmaceutical"
- "Biotech"
- Specific companies (e.g., Pfizer, Moderna, Johnson & Johnson)
- Extracted **author names and their respective company affiliations**.

### 3.3 Exporting Data to CSV

- Implemented CLI functionality with options:

- `-h` / `--help`: Display usage guide.
- `-f` / `--file`: Specify the filename for saving results.
- Stored output in a **well-structured CSV file**.

### 3.4 Project Structure & Packaging

- Organized the project directory:

- `src/pubmed_fetcher/` - Core logic
- `scripts/get_papers.py` - CLI script
- `tests/` - Unit testing
- Used **Poetry** for dependency and package management.
- Configured an **executable CLI command** (`get-papers-list`).

### 3.5 Publishing to TestPyPI

- Configured TestPyPI as the repository.

- Published package using `poetry publish -r testpypi`.
- Resolved potential **naming conflicts** to ensure successful deployment.

## **4. Results**

The script successfully generated a **CSV file** containing research papers with
industry-affiliated authors. Example output:

| PubmedID | Title | Publication Date | Non-Academic Authors |

Company Affiliations | Corresponding Author Email |
| -------- | ---------------------- | ---------------- | -------------------- |
-------------------- | ------------------------------------------------------ |
| 12345678 | COVID-19 Vaccine Study | 2023-08-15 | John Doe |
Pfizer | [[email protected]](mailto:[email protected]) |
| 87654321 | mRNA Vaccine Research | 2022-11-20 | Jane Smith |
Moderna | [[email protected]](mailto:[email protected]) |

### Key Findings:

- Successfully retrieved relevant research papers based on search queries.

- Correctly identified **pharmaceutical and biotech affiliations**.
- Extracted **author details & contact emails** for further study.

## **5. Conclusion**

This project efficiently automates the retrieval of **industry-affiliated research

papers** from **PubMed**, presenting results in a structured CSV format. The
approach ensures a **scalable and reusable** solution for filtering research papers
by **company affiliations**.

### Future Enhancements:

- Improved Affiliation Detection: Use AI-based entity recognition for more

precise filtering.
- **Database Storage**: Implement a **structured database** for better querying.
- **Advanced Filtering Options**: Add filters for **date range, author names, and
specific companies**.

This research tool provides an **automated, scalable, and efficient** method for
identifying industry-affiliated research studies in PubMed.

Getting started with Spring Framework: A Hands-on Guide to Begin Developing Applications Using Spring Framework
From Everand
Getting started with Spring Framework: A Hands-on Guide to Begin Developing Applications Using Spring Framework
Ashish Sarin
4.5/5 (2)
Learning Informatica PowerCenter 9.x
From Everand
Learning Informatica PowerCenter 9.x
Rahul Malewar
3/5 (4)
MySQL for Python
From Everand
MySQL for Python
Albert Lukaszewski
5/5 (1)
MEAN Web Development - Second Edition
From Everand
MEAN Web Development - Second Edition
Amos Q. Haviv
No ratings yet
Monal
100% (1)
Monal
4 pages
PPF Project (Approach, Methodology, Resuls)
No ratings yet
PPF Project (Approach, Methodology, Resuls)
1 page
Paper Fetcher Report
No ratings yet
Paper Fetcher Report
2 pages
Backend Takehome Problem
No ratings yet
Backend Takehome Problem
2 pages
IBM Cognos 8 Planning
From Everand
IBM Cognos 8 Planning
Jason Edwards
No ratings yet
Open Alex
No ratings yet
Open Alex
4 pages
Web Scraping for SEO with Python
From Everand
Web Scraping for SEO with Python
Enrique Vicente
No ratings yet
SRS - How to build a Pen Test and Hacking Platform
From Everand
SRS - How to build a Pen Test and Hacking Platform
alasdair gilchrist
2/5 (1)
Odoo 10 Development Essentials
From Everand
Odoo 10 Development Essentials
Daniel Reis
No ratings yet
Mastering Yii
From Everand
Mastering Yii
PortwoodII Charles R.
No ratings yet
Building Websites with VB.NET and DotNetNuke 4
From Everand
Building Websites with VB.NET and DotNetNuke 4
Daniel N. Egan
1/5 (1)
The Encrypted Web: Building Secure and Invisible Networks: Networking, #1
From Everand
The Encrypted Web: Building Secure and Invisible Networks: Networking, #1
Xettaiks
No ratings yet
Agile Web Application Development with Yii1.1 and PHP5
From Everand
Agile Web Application Development with Yii1.1 and PHP5
Jeffrey Winesett
3.5/5 (1)
FCSS—Enterprise Firewall 7.4 Administrator Exam Preparation
From Everand
FCSS—Enterprise Firewall 7.4 Administrator Exam Preparation
Georgio Daccache
No ratings yet
C++ Basics for New Programmers: A Practical Guide with Examples
From Everand
C++ Basics for New Programmers: A Practical Guide with Examples
William E. Clark
No ratings yet
Getting Started with Oracle Data Integrator 11g: A Hands-On Tutorial
From Everand
Getting Started with Oracle Data Integrator 11g: A Hands-On Tutorial
David Hecksel
5/5 (2)
Node.js 6.x Blueprints
From Everand
Node.js 6.x Blueprints
Fernando Monteiro
No ratings yet
Visual SourceSafe 2005 Software Configuration Management in Practice
From Everand
Visual SourceSafe 2005 Software Configuration Management in Practice
Aleksandar Seovic
No ratings yet
Django Unleashed: Building Web Applications with Python's Framework
From Everand
Django Unleashed: Building Web Applications with Python's Framework
Kameron Hussain
No ratings yet
Python Basics Made Simple: A Practical Guide with Examples
From Everand
Python Basics Made Simple: A Practical Guide with Examples
William E. Clark
No ratings yet
BeagleBone Media Center
From Everand
BeagleBone Media Center
David Lewin
No ratings yet
Creation of Postfix Mail Server Based on Virtual Users and Domains
From Everand
Creation of Postfix Mail Server Based on Virtual Users and Domains
Dr. Hidaia Mahmood Alassouli
No ratings yet
Python OOP Step by Step: A Practical Guide with Examples
From Everand
Python OOP Step by Step: A Practical Guide with Examples
William E. Clark
No ratings yet
Building Web Services with Microsoft Azure
From Everand
Building Web Services with Microsoft Azure
Alex Belotserkovskiy
No ratings yet
Applied Architecture Patterns on the Microsoft Platform Second Edition
From Everand
Applied Architecture Patterns on the Microsoft Platform Second Edition
Andre Dovgal
No ratings yet
DP-600: Implementing Analytics Solutions Using Microsoft Fabric Exam Preparation
From Everand
DP-600: Implementing Analytics Solutions Using Microsoft Fabric Exam Preparation
Georgio Daccache
No ratings yet
Code::Blocks Essentials: Definitive Reference for Developers and Engineers
From Everand
Code::Blocks Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Drupal for Humanists
From Everand
Drupal for Humanists
Quinn Dombrowski
No ratings yet
Building Websites with VB.NET and DotNetNuke 3.0
From Everand
Building Websites with VB.NET and DotNetNuke 3.0
Daniel N. Egan
1/5 (1)
Sphinx Search Beginner's Guide
From Everand
Sphinx Search Beginner's Guide
Abbas Ali
4/5 (2)
Visual Basic 2010 Coding Briefs Data Access
From Everand
Visual Basic 2010 Coding Briefs Data Access
Kevin Hough
5/5 (1)
Study Guide Cisco 300-735 SAUTO Automating and Programming Cisco Security Solutions Exam
From Everand
Study Guide Cisco 300-735 SAUTO Automating and Programming Cisco Security Solutions Exam
Anand Vemula
No ratings yet
Spring Data
From Everand
Spring Data
Petri Kainulainen
No ratings yet
Building Full Linux Mail Server Solution with Virtual Domains and Users
From Everand
Building Full Linux Mail Server Solution with Virtual Domains and Users
Dr. Hedaya Mahmood Alasooly
No ratings yet
Learning Docker
From Everand
Learning Docker
Pethuru Raj
5/5 (5)
Python Automation for Beginners: A Practical Guide with Examples
From Everand
Python Automation for Beginners: A Practical Guide with Examples
William E. Clark
No ratings yet
Project PDF
No ratings yet
Project PDF
20 pages
Professional Plone 4 Development
From Everand
Professional Plone 4 Development
Martin Aspeli
3.5/5 (5)
Mastering GeoServer
From Everand
Mastering GeoServer
Colin Henderson
No ratings yet
Python Programming: Learn, Code, Create
From Everand
Python Programming: Learn, Code, Create
Sachin Naha
No ratings yet
Learning ASP.NET Core MVC Programming
From Everand
Learning ASP.NET Core MVC Programming
Mugilan T. S. Ragupathi
5/5 (4)
Learning Drupal 6 Module Development
From Everand
Learning Drupal 6 Module Development
Matt Butcher
3/5 (1)
682a2650e89f0 Assignment
No ratings yet
682a2650e89f0 Assignment
5 pages
IBM Cognos Business Intelligence
From Everand
IBM Cognos Business Intelligence
Dustin Adkison
No ratings yet
IBM WebSphere Application Server v7.0 Security
From Everand
IBM WebSphere Application Server v7.0 Security
Omar Siliceo
No ratings yet
CoffeeScript Application Development
From Everand
CoffeeScript Application Development
Ian Young
No ratings yet
Core Data iOS Essentials
From Everand
Core Data iOS Essentials
B. M. Harwani
No ratings yet
Alfresco 3 Enterprise Content Management Implementation
From Everand
Alfresco 3 Enterprise Content Management Implementation
Amita Bhandari
3/5 (2)
Building Websites with OpenCms
From Everand
Building Websites with OpenCms
Matt Butcher
No ratings yet
FuelPHP Application Development Blueprints
From Everand
FuelPHP Application Development Blueprints
Sébastien Drouyer
No ratings yet
Applied Architecture Patterns on the Microsoft Platform
From Everand
Applied Architecture Patterns on the Microsoft Platform
Richard Seroter
No ratings yet
Mastering RethinkDB
From Everand
Mastering RethinkDB
Shahid Shaikh
No ratings yet
Microsoft Windows Security Essentials
From Everand
Microsoft Windows Security Essentials
Darril Gibson
5/5 (1)
Compu
No ratings yet
Compu
1 page
Project Report-Patient Health Record Sorter
No ratings yet
Project Report-Patient Health Record Sorter
6 pages
M Rizwan 241417
No ratings yet
M Rizwan 241417
6 pages
TMA-OB-Spring - 2023-2024
No ratings yet
TMA-OB-Spring - 2023-2024
4 pages
Da-1405 TDS en
No ratings yet
Da-1405 TDS en
1 page
Vero, Krishia Ann G. (DRRR Week #2)
No ratings yet
Vero, Krishia Ann G. (DRRR Week #2)
3 pages
Master Plan Porto Romano Bay Albania
100% (1)
Master Plan Porto Romano Bay Albania
138 pages
Felcom 12 15 16 Ssas Tie PDF
No ratings yet
Felcom 12 15 16 Ssas Tie PDF
80 pages
Calculation of Electrical Induction Near Power Lines
No ratings yet
Calculation of Electrical Induction Near Power Lines
22 pages
Candlesticks Report A Guide To Candlesticks
100% (2)
Candlesticks Report A Guide To Candlesticks
19 pages
Wise Holdings Vs Garcia
100% (2)
Wise Holdings Vs Garcia
2 pages
Data Sheet
No ratings yet
Data Sheet
3 pages
Robin Austin Resume
No ratings yet
Robin Austin Resume
4 pages
Cultures
No ratings yet
Cultures
3 pages
Design Basis: CE 315-Design of Concrete Structure - I Instructor: Dr. E. R. Latifee
No ratings yet
Design Basis: CE 315-Design of Concrete Structure - I Instructor: Dr. E. R. Latifee
2 pages
CB Model Gearbox Rebuild
No ratings yet
CB Model Gearbox Rebuild
7 pages
E11 BR PD
No ratings yet
E11 BR PD
6 pages
Spa - For Companies
No ratings yet
Spa - For Companies
2 pages
MaterialsTodayProceedings 1
No ratings yet
MaterialsTodayProceedings 1
9 pages
Algonquin College Oda Check List
No ratings yet
Algonquin College Oda Check List
17 pages
Presumption of Constitutionality
No ratings yet
Presumption of Constitutionality
17 pages
Chapter 17 - Answer PDF
No ratings yet
Chapter 17 - Answer PDF
5 pages
Solutions
100% (1)
Solutions
25 pages
CONSIDERING THE FUTURE OF THE PROFESSION-Artículo en Ingles PDF
No ratings yet
CONSIDERING THE FUTURE OF THE PROFESSION-Artículo en Ingles PDF
40 pages
ADSL Application Form
No ratings yet
ADSL Application Form
6 pages
451866136ba Ii Year
No ratings yet
451866136ba Ii Year
16 pages
Control of Static Electricity Work Instruction
No ratings yet
Control of Static Electricity Work Instruction
7 pages
(E-Book PDF) The Medical Examiner Service A Practical Guide For England and Wales 1st Edition Fast Download
100% (2)
(E-Book PDF) The Medical Examiner Service A Practical Guide For England and Wales 1st Edition Fast Download
15 pages
OptiFlex 2 GM03 Manual Gun Operation Manual-En-0611
No ratings yet
OptiFlex 2 GM03 Manual Gun Operation Manual-En-0611
42 pages
Cs 201 Long Quiz 2
No ratings yet
Cs 201 Long Quiz 2
3 pages
DL Industries Investor Presentation
No ratings yet
DL Industries Investor Presentation
28 pages
An Open Ended Contract
No ratings yet
An Open Ended Contract
5 pages
Ultrasonic Sensors: USA Series US-T50/R25 US-S25AN US-S300 Series US-1AH
No ratings yet
Ultrasonic Sensors: USA Series US-T50/R25 US-S25AN US-S300 Series US-1AH
19 pages

Report

Uploaded by

Report

Uploaded by

# **Research Paper Fetcher: Approach, Methodology, and Results**

The objective of this project was to develop a Python-based **command-line tool**

To ensure modularity and maintainability, the project followed a structured

1. **Fetching Research Papers**: Accessing **PubMed API** based on user queries.

### **3.1 Fetching Data from PubMed**

- Used the **PubMed API** to fetch research articles.

### **3.2 Filtering Industry-Affiliated Authors**

- Identified **non-academic affiliations** based on keywords:

### **3.3 Exporting Data to CSV**

- Implemented CLI functionality with options:

### **3.4 Project Structure & Packaging**

- Organized the project directory:

### **3.5 Publishing to TestPyPI**

- Configured **TestPyPI** as the repository.

| PubmedID | Title | Publication Date | Non-Academic Authors |

### **Key Findings:**

- Successfully retrieved **relevant research papers** based on search queries.

This project efficiently automates the retrieval of **industry-affiliated research

### **Future Enhancements:**

- **Improved Affiliation Detection**: Use **AI-based entity recognition** for more

You might also like

# Research Paper Fetcher: Approach, Methodology, and Results

The objective of this project was to develop a Python-based command-line tool

1. Fetching Research Papers: Accessing PubMed API based on user queries.

### 3.1 Fetching Data from PubMed

- Used the PubMed API to fetch research articles.

### 3.2 Filtering Industry-Affiliated Authors

- Identified non-academic affiliations based on keywords:

### 3.3 Exporting Data to CSV

### 3.4 Project Structure & Packaging

### 3.5 Publishing to TestPyPI

- Configured TestPyPI as the repository.

### Key Findings:

- Successfully retrieved relevant research papers based on search queries.

### Future Enhancements:

- Improved Affiliation Detection: Use AI-based entity recognition for more