Big Semantic Data Processing in The Materials Design Domain: Definitions
Big Semantic Data Processing in The Materials Design Domain: Definitions
Big Semantic Data Processing in The Materials Design Domain: Definitions
Design Domain
1
2 Lambrix, Armiento, Delin, Li
terials and for semantic query support ber of databases for phase identification
(e.g., Zhang et al (2015b, 2017)). are hosted. These databases have been
Further, standards for exporting in use by experimentalists for a long
data from databases and between tools time.
are being developed. These standards Springer Materials (http:
provide a way to exchange data be- //materials.springer.com/)
tween databases and tools, even if the contains among many other data sources
internal representations of the data in the well-known Landolt Bornstein
the databases and tools are different. database, an extensive data collection
They are a prerequisite for efficient from many areas of physical sciences
materials data infrastructures that allow and engineering. Similarly, The Japan
for the discovery of new materials National Institute of Material Science
(Austin (2016)). In several cases the (NIMS) Materials Database MatNavi
standards formalize the description of (http://mits.nims.go.jp/
materials knowledge (and thereby create index_en.html) contains a wide
ontological knowledge). collection of mostly experimental but
In the remainder of this section a brief also some computational electronic
overview of databases, ontologies and structure data.
standards in the field is given. Thermodynamical data, necessary
for computing phase diagrams with the
CALPHAD method, exist in many dif-
ferent databases (Campbell et al (2014)).
Databases Open access databases with relevant
data can be found through OpenCalphad
The Inorganic Crystal Struc- (http://www.opencalphad.
ture Database (ICSD, https: com/databases.html).
//icsd.fiz-karlsruhe.de/) Databases of results from electron
is a frequently utilized database for structure calculations have existed in
completely identified inorganic crystal some form for several decades. In 1978,
structures, with nearly 200k entries Moruzzi, Janak, and Williams published
(Belsky et al (2002); Bergerhoff et al a book with computed electronic prop-
(1983)). The data contained in ICSD erties such as, e.g., density of states,
serve as an important starting point bulk modulus and cohesive energy
in many electronic structure calcula- of all metals (Moruzzi et al (2013)).
tions. Several other crystallographic Only recently however, the use of such
information resources are also avail- databases have become widespread, and
able (Glasser (2016)). A popular open some of these databases have grown to a
access resource is the Crystallography substantial size.
Open Database (COD, http://www. Among the more recent efforts
crystallography.net/cod/) to collect materials properties ob-
with nearly 400k entries (Grazulis et al tained from electronic structure
(2012)). calculations publicly available a
At the International Cen- few prominent examples include the
tre for Diffraction Data (ICDD, Electronic Structure Project (ESP)
http://www.icdd.com/) a num- (http://materialsgenome.se)
Data Processing in Materials Design 5
be used during querying of databases CEN (2010) A guide to the development and
as well as in the process of connecting use of standards compliant data formats for
different resources. engineering materials test data European
Committee for standardization
Cheng X, Hu C, Li Y (2014) A semantic-
driven knowledge representation model for
the materials engineering application. Data
References Science Journal 13:26–44, DOI 10.2481/dsj.
13-061/
Agrawal A, Alok C (2016) Perspective: mate- Cheung K, Drennan J, Hunter J (2008) To-
rials informatics and big data: realization of wards an Ontology for Data-driven Discov-
the Fourth paradigm of science in materials ery of New Materials. In: McGuinness D,
science. APL Mater 4:053,208:1–10, DOI Fox P, Brodaric B (eds) Semantic Scientific
10.1063/1.4946894 Knowledge Integration AAAI/SSS Work-
Ashino T (2010) Materials Ontology: An In- shop, pp 9–14
frastructure for Exchanging Materials Infor- Curtarolo S, Setyawan W, Wang S, Xue J,
mation and Knowledge. Data Science Jour- Yang K, Taylor R, Nelson L, Hart G,
nal 9:54–61, DOI 10.2481/dsj.008-041 Sanvito S, Buongiorno-Nardelli M, Mingo
Austin T (2016) Towards a digital infrastruc- N, Levy O (2012) AFLOWLIB.ORG:
ture for engineering materials data. Mate- A distributed materials properties reposi-
rials Discovery 3:1–12, DOI 10.1016/j.md. tory from high-throughput ab initio cal-
2015.12.003 culations. Computational Materials Science
Belsky A, Hellenbrandt M, Karen VL, Luksch 58(Supplement C):227–235, DOI 10.1016/j.
P (2002) New developments in the Inor- commatsci.2012.02.002
ganic Crystal Structure Database (ICSD): Curtarolo S, Hart G, Buongiorno-Nardelli M,
accessibility in support of materials research Mingo N, Sanvito S, Levy O (2013)
and design. Acta Crystallographica Section The high-throughput highway to compu-
B: Structural Science 58(3):364–369, DOI tational materials design. Nature Materials
10.1107/S0108768102006948 12(3):191, DOI 10.1038/nmat3568
Bergerhoff G, Hundt R, Sievers R, Brown ID Euzenat J, Shvaiko P (2007) Ontology Match-
(1983) The inorganic crystal structure data ing. Springer
base. Journal of Chemical Information and Faber F, Lindmaa A, von Lilienfeld A,
Computer Sciences 23(2):66–69, DOI 10. Armiento R (2016) Machine Learn-
1021/ci00038a003 ing Energies of 2 Million Elpasolite
Bernstein HJ, Bollinger JC, Brown ID, Grazulis $(AB–C˝˙–2˝–D˝˙–6˝)$ Crystals. Phys-
S, Hester JR, McMahon B, Spadaccini N, ical Review Letters 117(13):135,502,
Westbrook JD, Westrip SP (2016) Specifica- DOI 10.1103/PhysRevLett.117.135502
tion of the crystallographic information file Frenkel M, Chiroco RD, Diky V, Dong
format, version 2.0. J Appl Cryst 49:277– Q, Marsh KN, Dymond JH, Wakeham
284, DOI 10.1107/S1600576715021871 WA, Stein SE, Knigsberger E, Goodwin
Bhat M, Shah S, Das P, Reddy S (2013) ARH (2006) XML-based IUPAC stan-
Premλ p: knowledge driven design of dard for experimental, predicted, and crit-
materials and engineering process. In: ically evaluated thermodynamic property
ICoRD’13, Springer, pp 1315–1329, data storage and capture (ThermoML)
DOI 10.1007/978-81-322-1050-4˙105 (IUPAC Recommendations 2006). Pure
Campbell CE, Kattner UR, Liu ZK (2014) Appl Chem 78:541612, DOI 10.1351/
File and data repositories for Next Genera- pac200678030541
tion CALPHAD. Scripta Materialia 70(Sup- Frenkel M, Chirico RD, Diky V, Brown
plement C):7–11, DOI 10.1016/j.scriptamat. PL, Dymond JH, Goldberg RN, Goodwin
2013.06.013 ARH, Heerklotz H, Knigsberger E, Lad-
Ceder G, Persson KA (2013) How Supercom- bury JE, Marsh KN, Remeta DP, Stein SE,
puters Will Yield a Golden Age of Materials Wakeham WA, Williams PA (2011) Ex-
Science. Scientific American 309 tension of ThermoML: The IUPAC stan-
dard for thermodynamic data communi-
10 Lambrix, Armiento, Delin, Li
informatics 3:44:1–44:15, DOI 10.1186/ tern and Its Use Case for Modeling Material
1758-2946-3-44 Transformation. Semantic Web 8:719–731,
Murray-Rust P, Townsend JA, Adams SE, DOI 10.3233/SW-160231
Phadungsukanan W, Thomas J (2011) The Zhang X, Hu C, Li H (2009) Semantic query on
semantics of Chemical Markup Language materials data based on mapping matml to
(CML): dictionaries and conventions. Jour- an owl ontology. Data Science Journal 8:1–
nal of Cheminformatics 3:43, DOI 10.1186/ 17, DOI 10.2481/dsj.8.1
1758-2946-3-43 Zhang X, Zhao C, Wang X (2015a) A sur-
Pizzi G, Cepellotti A, Sabatini R, Marzari N, vey on knowledge representation in materi-
Kozinsky B (2016) AiiDA: automated inter- als science and engineering: An ontological
active infrastructure and database for com- perspective. Computers in Industry 73:8–22,
putational science. Computational Materials DOI 10.1016/j.compind.2015.07.005
Science 111(Supplement C):218–230, DOI Zhang X, Pan D, Zhao C, Li K (2016) MMOY:
10.1016/j.commatsci.2015.09.013 Towards deriving a metallic materials on-
Premkumar V, Krishnamurty S, Wileden JC, tology from Yago. Advanced Engineering
Grosse IR (2014) A semantic knowledge Informatics 30:687–702, DOI 10.1016/j.aei.
management system for laminated com- 2016.09.002
posites. Advanced engineering informatics Zhang X, Chen H, Ruan Y, Pan D, Zhao C
28(1):91–101, DOI 10.1016/j.aei.2013.12. (2017) MATVIZ: a semantic query and vi-
004 sualization approach for metallic materials
Radinger A, Rodriguez-Castro B, Stolz A, data. International Journal of Web Infor-
Hepp M (2013) Baudataweb: the austrian mation Systems 13:260–280, DOI 10.1108/
building and construction materials mar- IJWIS-11-2016-0065
ket as linked data. In: Proceedings of the Zhang Y, Luo X, Zhao Y, chao Zhang
9th International Conference on Semantic H (2015b) An ontology-based knowledge
Systems, ACM, pp 25–32, DOI 10.1145/ framework for engineering material se-
2506182.2506186 lection. Advanced Engineering Informat-
Rajan K (2015) Materials Informatics: The ics 29:9851000, DOI 10.1016/j.aei.2015.09.
Materials Gene and Big Data. Annu Rev 002
Mater Res 45:153–169, DOI 10.1146/
annurev-matsci-070214-021132
Saal JE, Kirklin S, Aykol M, Meredig
B, Wolverton C (2013) Materials De-
sign and Discovery with High-Throughput
Density Functional Theory: The Open
Quantum Materials Database (OQMD).
JOM 65(11):1501–1509, DOI 10.1007/
s11837-013-0755-4
Soldatova LN, King RD (2006) An ontology
of scientific experiments. J R Soc Inter-
face 3(11):795–803, DOI 10.1098/rsif.2006.
0134
Swindells N (2009) The representation and ex-
change of material and other engineering
properties. Data Science Journal 8:190–200,
DOI 10.2481/dsj.008-007
van der Vet P, Speel PH, Mars N (1994) The
Plinius ontology of ceramic materials. In:
Mars N (ed) Workshop Notes ECAI’94
Workshop Comparison of Implemented On-
tologies, pp 187–205
Vardeman C, Krisnadhi A, Cheatham M,
Janowicz K, Ferguson H, Hitzler P, Buc-
cellato A (2017) An Ontology Design Pat-