0% found this document useful (0 votes)

125 views31 pages

Sem A Tic Microsoft

This document discusses a semantic application for digital repositories that enables linking and relationships between research outputs such as papers, videos, presentations, and data. It describes a research output repository platform built on SQL Server 2008 and Entity Framework that utilizes semantic technologies like ontologies to capture relationships. The goal is to create an ecosystem where researchers can manage their work and it can be easily shared, harvested, and discovered. The project is currently in a technical preview stage with a public beta planned for later in 2008.

Uploaded by

Abdul Khalique

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

125 views31 pages

Sem A Tic Microsoft

Uploaded by

Abdul Khalique

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

Semantic Application for

Digital Repositories

Fabrizio Gagliardi
EMEA & LATAM Director
Technical Computing
MSR External Research
Microsoft Corporation
Microsoft Research’s Commitment to Science

Putting computing into science…

Applying Microsoft products and research technologies to advance
the scientific research and engineering innovation process

Putting science into computing…

Ensuring that research community requirements are factored into
future versions of Microsoft software

• Advancement of Science
• Global Collaboration
• Technology Excellence
• Interoperability
myGrid

• Semantic relationships between different data

• Semantic descriptions of services
• Annotations
• Provenance
• Repositories
• Ontologies
Research Output Repository Platform

Goals
• A platform for building services and tools for research
output repositories
• Papers, Videos, Presentations, Lectures,
References, Data, Code, etc.
• Relationships between stored entities
• Enable a tools and services ecosystem for “research UIs

output” repositories on MS technologies

Desktop
Search
Execution Research Tools
• Utilizing OAI-ORE, SWORD, and other output
community protocols repository
platform
• In development, deployment within MSR in early Q4
• Beta release to the community in late Q4
• Built on SQL Server 2008 + Entity Framework
Interop Syndication
• Using WPF and Silverlight for UI
Research Output Repository Platform
Goals Non-goals
• Create a platform for building • A generic platform for asset
“research output” repositories management
• Engage with the digital library and • Support the lifecycle of publications
scholarly communications • Compete with existing repository
community solutions
• Become the “research output”
repository for MSR (RMCr project)
Services/tools
– Papers, Videos, Presentations, Lectures,
References, Data, Code, etc.
• Support an ecosystem of services and
tools
Microsoft.Famulus.Framework
• Available to the community for free
(we are still considering the open Microsoft.Famulus.Core
source route) (Based on the Entity Framework Model + extensions)

• Build an easy-to-install collection of SQL Server 2008, MS data storage technologies, Entity
basic services and tools Framework runtime
Research Output Repository Platform

• A Semantic Computing platform

• A hybrid between a relational database and a triple store

Triple stores
-Evolution friendly Relational schema
-Poor performance -Evolution not so easy
-No need to model everything in advance -Great opportunities for optimization
-Semantic interpretation at the application level -Model everything in advance

Research Output Repository Platform

-Maintain a balance
-Try to model the frequently used entities in our app domain
-Try to capture the frequently used relationships
-Allow for extensibility (Relationships, Attributes)
An intuitive programming experience

Person tony = new Person();

Publication pub1 = new Publication();
pub1.Title = "Title1";
Publication pub2 = new Publication();
pub2.Title = "Title2";

pub1.Cites.Add(pub2);
pub1.Authors.Add(tony);

Tag tag = new Tag();

tag.Name = "keyword";
pub1.Tags.Add(tag);
Research Output Repository Platform

PDF file

Lecture on
is representation of contains 2/19/2008

PowerPoint
presentation

authored by
organized by
presented by

tony

Elizabeth, Sebastien,
Matthew, Norman,
Brian, Sarah, George, Roy
An Ecosystem of Research Repositories
Support of harvesting & federation
to/from Institutional Repositories
- arXiv.org
- DSpace
- ePrints
- Fedora
- etc.

Entities + Relationships can be synched

to cloud storage so that they are:
- Always Available
- Sharable
- Mixable
- Harvestable

Researchers manage their personal research entities

(data, citations, documents, workflows, etc.)
Current Project Status

• Limit Tech Preview release due June 2008

• Public Beta targeted for Aug/Sept 2008

For more details

– Contact:
• Alex Wade (Program Manager) / [email protected]
– Community Forum:
• http://community.research.microsoft.com/forums/90.aspx
eScience and Semantic Computing
meet the Cloud
The cyberinfrastructure for the next
generation of researchers
The Future: Software plus Services for Science?

• Expect scientific research environments will follow

similar trends to the commercial sector
– Leverage computing and data storage in the cloud
– Scientists already experimenting with Amazon S3 and EC2
services, with mixed results;
• For many of the same reasons
– Siloed research teams, no resource sharing across labs
– High storage costs
– Low resource utilization
– Excess capacity
– High costs of reliably keeping machines up-to-date
– Little support for developers, system operators

12
A smart cyberinfrastructure

• Collective intelligence
– If last.fm can recommend what song to broadcast to me
based on what my friends are listening to, why cannot the
cyberinfrastructure of the future recommend articles of
potential interest based on what the experts in the field
that I respect are reading?
– Already examples emerging but the process is manual
(Connotea, BioMedCentral Faculty of 1000 ...)
• Automatic correlation of scientific data
• Smart composition of services and functionality
• Cloud computing to aggregate, process, analyze and
visualize data
A world where all data is linked…
• Data/information is inter-
connected through machine-
interpretable information (e.g.
paper X is about star Y)
• Social networks are a special case
of ‘data networks’

• Important/key considerations
– Formats or “well-known” representations
of data/information
– Pervasive access protocols are key (e.g.
HTTP)
– Data/information is uniquely identified
(e.g. URIs)
– Links/associations between
data/information

Attribution: Richard Cyganiak

…and stored/processed/analyzed in the
cloud
visualization and
Vision of Future Research analysis services scholarly
communications
Environment with both domain-specific services search
Software + Services blogs &
books
citations
social networking

Reference instant
management messaging

Project identity
mail
management
notification

document store

storage/data
services
knowledge
compute
management
services
knowledge virtualization
discovery
Added slides
eScience
Emergence of a New Research Paradigm?
• Thousand years ago – Experimental Science
– Description of natural phenomena
• Last few hundred years – Theoretical Science
– Newton’s Laws, Maxwell’s Equations…
• Last few decades – Computational Science 2
– Simulation of complex phenomena  . 
a 4G c2
 a    
• Today – eScience or Data-centric Science  
3 a2
– Unify theory, experiment, and simulation
– Using data exploration and data mining
• Data captured by instruments
• Data generated by simulations
• Data generated by sensor networks
– Scientists overwhelmed with data
– Computer Science and IT companies
have technologies that will help

(With thanks to Jim Gray)

Today

Web users... Scientists...

• Generate content on the Web • Annotate, share, discover data
– Blogs, wikis, podcasts, videocasts, – Custom, standalone tools
etc.
• Form communities • Conferences, Journals
– Social networks, virtual worlds – Publication process is long,
subscriptions, discoverability issues
• Interact, collaborate, share • Collaborate on projects, exchange
– Instant messaging, web forums, ideas
content sites – Email, F2F meetings, video-
conferences
• Consume information and • Use workflow tools to compose
services services
– Search, annotate, syndicate – Domain-specific services/tools
Data can be easily produced

http://ecrystals.chem.soton.ac.uk
Thanks to Jeremy Frey
Data and services can be easily composed

Taverna Workflow
Compose services from the Web

SensorMap
Functionality: Map navigation
Data: sensor-generated temperature, video camera feed,
traffic feeds, etc.
Data is easily accessible

With thanks to
Catharine van Ingen
Data is easily shareable

Sloan Digital Sky Server/SkyServer

http://cas.sdss.org/dr5/en/
Today…

storing computing
Computers are huge amounts
great tools for managing indexing
of data

For example, Google and Microsoft both have copies of the Web
for indexing purposes
Tomorrow…

storing computing
Computers will still huge amounts
be great tools for managing indexing
of data

acquisition discovery

We would like
aggregation organization
computers to also of the world’s
help with the correlation analysis information
automatic
interpretation inference
Semantic Computing
What is Semantic Computing?

• Set of concepts and technologies

– Data modeling
– Relationships
– Ontologies
– Machine learning (entity extraction)
– Inference, reasoning
– Data, information, knowledge…

Data Information Knowledge Intelligence Wisdom

Current technologies

Possibilities for innovation

Semantics

• Term used to refer to the concept of “meaning”

• The linguistics, AI, Natural Language Processing,
etc. communities have been working on
“meaning” and ”knowledge” related technologies
for decades
• Pragmatic approach to Semantic Computing
– Emergence of a new breed of technologies to capture
meaning (RDF, OWL, etc.)
– Combine with the pervasiveness of the Web
community technologies such as folksonomies …
A word about the “Semantic Web”
• The term is used to describe a set
of technologies used to represent
data, concepts, and their
relationships
– Become a buzzword like Web 2.0

• Prefer to use the term “Semantic

Computing” which is about
modeling data in ways that can
be automatically processed by
computers
Semantic Computing

• Some efforts are driven by the traditional

“knowledge engineering” community
– Engaged in building well-controlled ontologies
– Important for domain-specific vocabularies with data
formats and relationships specific to a community
– Model does not easily scale to the Internet
• Some efforts are driven by the Web 2.0 community
– Focus on the pervasiveness of Web protocols/standards
– Emphasis on microformats (small, flexible, embeddable
structures)
– Exploit evolving and ever-expanding vocabularies such as
folksonomies and tag clouds

English Grammar and Exercises Book 1
89% (35)
English Grammar and Exercises Book 1
76 pages
Deng201 English II
0% (1)
Deng201 English II
322 pages
Understanding The Transfer of Prepositions: Arabic To English
No ratings yet
Understanding The Transfer of Prepositions: Arabic To English
7 pages
Deborah L. McGuinness Explaining Complex Systems
No ratings yet
Deborah L. McGuinness Explaining Complex Systems
30 pages
Parts of Speech and Functionality
100% (1)
Parts of Speech and Functionality
1 page
Logic and Language
100% (1)
Logic and Language
53 pages
The 100 Most Essential Words in Anime
No ratings yet
The 100 Most Essential Words in Anime
8 pages
Introduction To Grid Computing
No ratings yet
Introduction To Grid Computing
59 pages
Catalogo Hub Readers Digital
No ratings yet
Catalogo Hub Readers Digital
44 pages
Unit 2 The Propositional Logic: Structure Page Nos
No ratings yet
Unit 2 The Propositional Logic: Structure Page Nos
23 pages
A Description of Metaphors in Arctic Monkeys
No ratings yet
A Description of Metaphors in Arctic Monkeys
69 pages
2 Lexicology and Lexicography
No ratings yet
2 Lexicology and Lexicography
10 pages
Tema 14 CEN - The Expression of Quality
100% (1)
Tema 14 CEN - The Expression of Quality
30 pages
Cloud COMPUTING Module 4
No ratings yet
Cloud COMPUTING Module 4
50 pages
Materi Translaion 2011
No ratings yet
Materi Translaion 2011
27 pages
Chapter 10 - Morphology - Theoretical Challenges
No ratings yet
Chapter 10 - Morphology - Theoretical Challenges
13 pages
Innovative Mobile Services
No ratings yet
Innovative Mobile Services
89 pages
An Analysis of Transitivity: Putri Mustika Sari NIM: 5.16.06.14.0.0.19
No ratings yet
An Analysis of Transitivity: Putri Mustika Sari NIM: 5.16.06.14.0.0.19
8 pages
Searle, John - Proper Names
100% (1)
Searle, John - Proper Names
9 pages
Lesson 7
100% (1)
Lesson 7
8 pages
Making Sense of The Semantic Web: Nova Spivack CEO & Founder Radar Networks
No ratings yet
Making Sense of The Semantic Web: Nova Spivack CEO & Founder Radar Networks
36 pages
An Introduction To Applicative Universal Grammar
100% (1)
An Introduction To Applicative Universal Grammar
64 pages
Big Data Management
No ratings yet
Big Data Management
11 pages
Distributed Software Development Tools F
No ratings yet
Distributed Software Development Tools F
19 pages
D.S Assignment Answer
100% (2)
D.S Assignment Answer
23 pages
Pertemuan 12 - Future Trends - Privacy and Managerial Considerations in Analytics
No ratings yet
Pertemuan 12 - Future Trends - Privacy and Managerial Considerations in Analytics
52 pages
Richard Benjamins, John Davies, Elmar Dorner, John Domingue, Dieter Fensel, Ozelin López, Raphael Volz, Alexander Wahler, Michal Zaremba
No ratings yet
Richard Benjamins, John Davies, Elmar Dorner, John Domingue, Dieter Fensel, Ozelin López, Raphael Volz, Alexander Wahler, Michal Zaremba
20 pages
Crosswalks, Metadata Harvesting, Federated Searching, Metasearching: Using Metadata To Connect Users and Information
No ratings yet
Crosswalks, Metadata Harvesting, Federated Searching, Metasearching: Using Metadata To Connect Users and Information
25 pages
UbiCom Notes
No ratings yet
UbiCom Notes
3 pages
50 Essential Grammar Rules
No ratings yet
50 Essential Grammar Rules
65 pages
Data Mining Foster
No ratings yet
Data Mining Foster
26 pages
Searching The Internet of Things
No ratings yet
Searching The Internet of Things
24 pages
Lesson 8 Vocab A1 Rikai Marugoto
No ratings yet
Lesson 8 Vocab A1 Rikai Marugoto
2 pages
Sematic Web: Group Members
No ratings yet
Sematic Web: Group Members
44 pages
Avoiding Colloquial Writing
No ratings yet
Avoiding Colloquial Writing
3 pages
Unit 5 - Cloud Computing
No ratings yet
Unit 5 - Cloud Computing
62 pages
Information Management: in The Age of Cloud Computing
No ratings yet
Information Management: in The Age of Cloud Computing
70 pages
Mining Complex Predicates in Hindi Using A Parallel Hindi-English Corpus
No ratings yet
Mining Complex Predicates in Hindi Using A Parallel Hindi-English Corpus
7 pages
IOT Unit 2
No ratings yet
IOT Unit 2
25 pages
Semantic Web and Ontology Engineering: ITKS544
100% (1)
Semantic Web and Ontology Engineering: ITKS544
78 pages
4 - Gerund and Infinitive
No ratings yet
4 - Gerund and Infinitive
18 pages
The Role of Linguistics in English Language Teaching.
No ratings yet
The Role of Linguistics in English Language Teaching.
8 pages
Web of Things and Cloud of Things
No ratings yet
Web of Things and Cloud of Things
63 pages
Big Data in Research and Education
No ratings yet
Big Data in Research and Education
70 pages
Ontology Based Word Sense Disambiguation
No ratings yet
Ontology Based Word Sense Disambiguation
8 pages
Support System For Industry4.0
No ratings yet
Support System For Industry4.0
23 pages
The Semantic Web in Action
No ratings yet
The Semantic Web in Action
8 pages
Social Network Analysis
No ratings yet
Social Network Analysis
117 pages
Indriyani
No ratings yet
Indriyani
15 pages
Ubiquitous Computing
No ratings yet
Ubiquitous Computing
27 pages
Data Analytics and Hadoop
No ratings yet
Data Analytics and Hadoop
21 pages
UNIT-1 Introduction & Cloud Infrastructure
100% (1)
UNIT-1 Introduction & Cloud Infrastructure
26 pages
Methods and Procedures of Lexicological Analysis
No ratings yet
Methods and Procedures of Lexicological Analysis
4 pages
Scan Doc0007
No ratings yet
Scan Doc0007
8 pages
ModernComputingVision1 s2.0 S2772503024000021 Main
No ratings yet
ModernComputingVision1 s2.0 S2772503024000021 Main
38 pages
SNS Unit I
100% (1)
SNS Unit I
31 pages
Final Semantic Web Unit1 - QuestionsandAnswer Booklet - R18
No ratings yet
Final Semantic Web Unit1 - QuestionsandAnswer Booklet - R18
17 pages
Sematic Web: Bachelor of Technology
No ratings yet
Sematic Web: Bachelor of Technology
26 pages
UNIT-1 Notes
No ratings yet
UNIT-1 Notes
28 pages
A New Trend in Data Warehousing2
No ratings yet
A New Trend in Data Warehousing2
6 pages
Unit I
No ratings yet
Unit I
25 pages
SNSW Unit-IV
No ratings yet
SNSW Unit-IV
8 pages
IP 083 Emerging Trends 2023
No ratings yet
IP 083 Emerging Trends 2023
41 pages
Correction of Master2 Exam Stylistics
No ratings yet
Correction of Master2 Exam Stylistics
2 pages
UNIT
No ratings yet
UNIT
9 pages
SNSW Unit-4
No ratings yet
SNSW Unit-4
8 pages
Sample Paper 3 AI Class 10
No ratings yet
Sample Paper 3 AI Class 10
7 pages
Semantic Web: BY-MANIT PANWAR (00116404509) M.C.A (SE), 1 SEM
No ratings yet
Semantic Web: BY-MANIT PANWAR (00116404509) M.C.A (SE), 1 SEM
12 pages
SW Unit-V Notes
No ratings yet
SW Unit-V Notes
16 pages
#2004 - Zhuge - IEEE IS - China's E-Science Knowledge Grid Environment
No ratings yet
#2004 - Zhuge - IEEE IS - China's E-Science Knowledge Grid Environment
5 pages
Semantic Web Unit-I
No ratings yet
Semantic Web Unit-I
7 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
3 pages
SNS Unit 1 Notes
No ratings yet
SNS Unit 1 Notes
25 pages
Ye Et Al - Semantic Web Technologies in Pervasive Computing
No ratings yet
Ye Et Al - Semantic Web Technologies in Pervasive Computing
25 pages
Unit I Fundamentals of Social Networking
No ratings yet
Unit I Fundamentals of Social Networking
23 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
34 pages
Chapter 6 Trends
No ratings yet
Chapter 6 Trends
15 pages
Unit I
No ratings yet
Unit I
9 pages
Emerging Trends Notes-2
No ratings yet
Emerging Trends Notes-2
7 pages
Recent Trends in Technology
No ratings yet
Recent Trends in Technology
11 pages
MLecture 1
No ratings yet
MLecture 1
41 pages
Semantic Web Unit-I
No ratings yet
Semantic Web Unit-I
7 pages
Hands-On Python for DevOps: Leverage Python's native libraries to streamline your workflow and save time with automation
From Everand
Hands-On Python for DevOps: Leverage Python's native libraries to streamline your workflow and save time with automation
Ankur Roy
No ratings yet
Mastering OpenStack: Implement the latest techniques for designing and deploying an operational, production-ready private cloud
From Everand
Mastering OpenStack: Implement the latest techniques for designing and deploying an operational, production-ready private cloud
Omar Khedher
No ratings yet
OpenStack Cloud Security
From Everand
OpenStack Cloud Security
Fabio Alessandro Locati
No ratings yet
OpenStack Essentials
From Everand
OpenStack Essentials
Dan Radez
No ratings yet
Mastering OpenStack: Design, deploy, and manage clouds in mid to large IT infrastructures
From Everand
Mastering OpenStack: Design, deploy, and manage clouds in mid to large IT infrastructures
Omar Khedher
No ratings yet
OpenStack Sahara Essentials
From Everand
OpenStack Sahara Essentials
Omar Khedher
No ratings yet
Extending Puppet - Second Edition
From Everand
Extending Puppet - Second Edition
Alessandro Franceschi
No ratings yet
Mastering ServiceStack: Utilize ServiceStack as the rock solid foundation of your distributed system
From Everand
Mastering ServiceStack: Utilize ServiceStack as the rock solid foundation of your distributed system
Andreas Niedermair
No ratings yet

Sem A Tic Microsoft

Uploaded by

Sem A Tic Microsoft

Uploaded by

Semantic Application for

Putting computing into science…

Putting science into computing…

• Semantic relationships between different data

output” repositories on MS technologies

• A Semantic Computing platform

Research Output Repository Platform

Person tony = new Person();

Tag tag = new Tag();

Entities + Relationships can be synched

Researchers manage their personal research entities

• Limit Tech Preview release due June 2008

For more details

• Expect scientific research environments will follow

Attribution: Richard Cyganiak

(With thanks to Jim Gray)

Web users... Scientists...

Sloan Digital Sky Server/SkyServer

• Set of concepts and technologies

Data Information Knowledge Intelligence Wisdom

Possibilities for innovation

• Term used to refer to the concept of “meaning”

• Prefer to use the term “Semantic

• Some efforts are driven by the traditional

You might also like