0% found this document useful (0 votes)
72 views

Jia Li

Jia Li is a PhD student at the Institute of Computing Technology, Chinese Academy of Sciences with a GPA of 86.7/100. She has research experience in mining structured data from the web and Chinese word segmentation. She has implemented machine learning algorithms and designed a reusable task distribution framework. She has received several academic honors and scholarships for her work.

Uploaded by

vena900620
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
72 views

Jia Li

Jia Li is a PhD student at the Institute of Computing Technology, Chinese Academy of Sciences with a GPA of 86.7/100. She has research experience in mining structured data from the web and Chinese word segmentation. She has implemented machine learning algorithms and designed a reusable task distribution framework. She has received several academic honors and scholarships for her work.

Uploaded by

vena900620
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Jia Li

[email protected] +86-13810113047 Institute of Computing Technology Chinese Academy of Sciences Beijing, China, 100190

EDUCATION Sep 2011 - Present Institute of Computing Technology, Chinese Academy Sciences GPA: 86.7/100 Sep 2007 - July 2011 B.Eng. Software Engineering, Dalian University of Technology GPA: 86.8/100, Rank: 1st/28 (class), 13th/397 (grade) ACADEMIC ACTIVITIES Aug 2012 - Sep 2012 Machine Learning Summer School (with scholarship) at Kyoto, Japan Dec 2011 - Apr 2012 Intern at WSM Group, Microsoft Research Asia Oct 2010 - May 2011 Intern at Institute of Computing Technology, Chinese Academy of Sciences RESEARCH AND PROJECT EXPERIENCES Mining Field Related Data from the Web (paper to be submitted) Dec 2011 - Apr 2012 Research Intern at Microsoft Research Asia, Mentor: Yunbo Cao Developed a prototype system which is capable of discovering and extracting eld-related structured data from the Web. There are two major components in this system. The rst one is mining related web pages by leveraging query log. The second one is extracting structured data from the web pages. I proposed a new bi-tag path method to extract the structural entity names as well as entity attributes from these pages. At the same time, semantic labels can be assigned to entity attributes when they are extracted. The system is going to become a part of Microsoft product system. Moreover,I wrote a paper to illustrate the methods and the results. Chinese Words Segmentation May 2012 - June 2012 at Institute of Computing Technology, Chinese Academy of Sciences Implemented several classic algorithms including Forward Maximum Match, Backward Maximum Match, and Shortest Path Algorithm. Proposed a new method: merging single words with small probabilities which solves the problem of undened words. Result: stable improvement (about 2-percentage) of F1-score for all the three classic methods. Implementation of Classic Algorithms in Machine Learning Oct 2011 - Nov 2011 at Institute of Computing Technology, Chinese Academy of Sciences Implemented classic algorithm including Na Bayes, Back Propagation Neural Network, ve Perception Approach, K-Means. Reusable Task Distribution Framework Nov 2010 - May 2011 at Institute of Computing Technology, Chinese Academy of Sciences Designed and implemented a reusable framework for task dispatch without knowing any knowledge about the code inside. There are three kinds of nodes: master, client, and result. The master node is in charge of the allocation of the tasks to the client and send the result to

the result node. The client nodes are responsible for the task execution and send the result back to the master. In the result node, users can deal with the results. In order to guarantee the security of data, we used shared memory to solve when sharing data between dierent nodes so that data will never be written to the disk. Automatic Paper Format Check System Nov 2009 - Apr 2010 at Dalian University of Technology, advisor: Guohai Jiang There are certain paper formats required by every university. It requires a lot of eort to check the format by human. So we wanted to build a system which is capable to check the format automatically by examining the XML le of word. I nished the part of checking the format of cover, footer, header, page layout as well as getting the format of footer automatically. SCHOLARSHIPS AND HONORS Competition Awards 2011 Honorable Mention in the 36th ACM/ICPC Asia Regional contest 2010 First Prize in ACM/ICPC China Northeast Area Programming Contest 2010 First Prize in ACM/ICPC Liaoning Province Programming Contest 2010 Second Prize in ACM/ICPC Dalian University of Technology Programming Contest 2010 First Prize in ACM/ICPC College Programming Contest 2010 Third Prize in National English Contest Scholarships & Honors 2010 National Scholarship (less than 1 % undergraduate students) 2012 Scholarship from Machine Learning Summer School 2009 - 2010 First-class Learning Scholarship (twice) 2008 Second-class Learning Scholarship of Dalian University of Technology 2009 - 2010 Spiritual Scholarship (twice) 2010 Scientic and Technical Innovation Scholarship 2010 Style and Activity Scholarship 2009 Second-class Mitsubishi Chemical Scholarship 2008 Eastern Industrial Scholarship 2010 Outstanding Graduates of Dalian 2009 - 2010 School Top Three Student (twice) RESEARCH INTEREST AND RELATED COURSES Social Network, Machine Learning, Data Mining and Information Retrieval. Data Mining 86 Pattern Recognition 91 Nature Language Processing 92 Information Retrieval 88 Computational Linguistics 93 Social Network Mining and Applications 90

PROGRAMMING SKILLS Qualied in C, C++, C#, have certain knowledge of Java and network programming, procient in XML, SqlServer. ENGLISH SKILLS GRE: Verbal 159, Quantitative 170, AW 3.5 TOEFL: 109 (R 28, L 29, S 22, W 30) Third Price in National English Contest, Passed the BEC Vantage Test

You might also like