Introduction Advanced DB
Introduction Advanced DB
Jaime Arguello
[email protected]
January 8, 2014
10
11
http://www.dmoz.org 12
http://www.dmoz.org 13
14
http://www.yelp.com/biz/cosmic-cantina-chapel-hill
(not actual page)
15
16
17
18
19
20
query
corpus
results
web pages
21
query
corpus
results
scientific
publications
22
query
corpus
results
news articles
23
query
corpus
results
curated/synthesized
business listings
24
query
corpus
results
files in my laptop
25
query
corpus
results
tweets
26
query
corpus
results
profiles
27
29
30
32
(AOL query-log) 33
34
• Description:
????????????????????????????????????????????????????????????????
????????????????????????????????????????????????????????????????
????????????????????????????????????????????????????????????????
????????????????????????????????????????????????????????????????
????????????????????????????????????????????????????????????????
????????????????????????????????????????????????????????????????
????????????????????????????????????????????????????????????????
???????????????
37
38
39
40
42
...
43
...
44
...
45
...
46
...
47
• Lots of in-links
(endorsements)
• Non-spam properties:
‣ grammatical sentences
‣ no profanity
• Has good formatting
...
48
• Peer-reviewed by many
• Reading-level appropriate
for user community
• Has pictures
• Normal length
...
49
50
A B
53
A B
A B
55
A B
A B
57
A B
A B
59
A B
A B
61
A B
63
January 8, 2014
67
69
‣ test-collection construction
‣ evaluation metrics
‣ experimentation
‣ user studies
‣ search-log analysis
• Studies of search behavior
• Federated Search
• Clustering
• Text Classification
71
‣ 10% each
• 15% midterm
‣ 5% proposal
‣ 10% presentation
‣ 15% paper
72
• H: 95-100%
• P: 80-94%
• L: 60-79%
• F: 0-59%
73
• A: 94-96% • D: 64-66%
• B: 84-86%
• B-: 80-83%
• C+: 77-79%
• C: 74-76%
• C-: 70-73%
74
75
• Form groups of 2 or 3
• Make a presentation
• Book search
• Faceted search
• Federated search 77
• Be thorough
• Be scientific
78
• Do other readings
80