Web scraping script was created to extract articles information from PubMed database https://www.ncbi.nlm.nih.gov/pubmed/.
Data is stored in MongoDB first then extracted to conduct data preprcoessing, manipulation and visualizaiton. More information could be found on http://woodenleaves.com/pages/pubmed.html
Python(Selenium, BeautifulSoup, Requests, Multiprocessing, Pandas, pymongo, re, bokeh, matplotlib)
MongoDB
ECharts.js
Data preprocessing, statistical analysis and data visualizaton
