PPDM 2 Definition
PPDM 2 Definition
●
Sources: business, science, medicine, economics,
geography, environment, sports, …
●
Potentially valuable resource
●
Raw data is useless: need techniques to
automatically extract information from it
◆Data: recorded facts
◆Information: patterns underlying the data
2
Data mining
Extracting information from data
●
◆implicit,
◆previously unknown,
◆potentially useful
●
Needed: programs that detect patterns and
regularities in the data
●
Strong patterns => good predictions
◆Problem 1: most patterns are not interesting
◆Problem 2: patterns may be inexact (or spurious)
… … … … …
5
Can machines really learn?
●
Definitions of “learning” from dictionary:
To get knowledge of by study, Difficult to measure
experience, or being taught
To become aware by information or
from observation
To commit to memory Trivial for computers
To be informed of, ascertain; to
receive instruction
Operational definition:
●
8
Applications
•The result of learning—or the learning method itself—is
deployed in practical applications
–Processing loan applications
–Screening images for oil slicks
–Electricity supply forecasting
–Diagnosis of machine faults
–Marketing and sales
–Separating crude oil and natural gas
–Reducing banding in rotogravure printing
–Finding appropriate technicians for telephone faults
–Scientific applications: biology, astronomy, chemistry
–Automatic selection of TV programs
–Monitoring intensive care patients
from it?
●
Caveats (warnings) must be attached to
results
●
Purely statistical arguments are never
sufficient!
●
Are resources put to good use?
12