PubMed

Introduction:

Web scraping script was created to extract articles information from PubMed database https://www.ncbi.nlm.nih.gov/pubmed/.

Data is stored in MongoDB first then extracted to conduct data preprcoessing, manipulation and visualizaiton. More information could be found on http://woodenleaves.com/pages/pubmed.html

Python(Selenium, BeautifulSoup, Requests, Multiprocessing, Pandas, pymongo, re, bokeh, matplotlib)

MongoDB

ECharts.js

Data preprocessing, statistical analysis and data visualizaton

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
PubMed.ipynb		PubMed.ipynb
PubMed_Scraping.py		PubMed_Scraping.py
README.md		README.md
pubmed.png		pubmed.png
pubmed16.csv		pubmed16.csv