Automatic Ontology Creation by Extracting Metadata From The Source Code
Automatic Ontology Creation by Extracting Metadata From The Source Code
Automatic Ontology Creation by Extracting Metadata From The Source Code
0) November 2010
I.
INTRODUCTION
RELATED WORK
Metadata
Ontology
EXTRACTION FRAMEWORK
After the completion of a project all the project files are sent
to the metadata extraction frame work. The files of a project
can be asked to give in form of zip file which contains
various types of files. The framework will extract files and
give it to corresponding Document Generator tools and it
will convert the file metadata into XML file. This paper
deals with java files only. To extract file content metadata
the source code is processed by Javadoc. The retrieved
components will be stored in the HBase, which is the
subproject of Hadoop. The project may contain files in
various languages and tools. Similarly there are many parser
tools available to parse the source code from various
languages and tools to either HTML or XML output. So the
frame work will indentify the type of project file and supply
the file to the corresponding parser. Two types of metadata
will be extracted. First is the file metadata and it contains
file attributes, file size, parent, path, etc. The second type of
metadata contains the contents of the source file. The file
content metadata will be different for different types of
languages and tools. Here the frame work presents the
general form for all type of project files. The frame work
output shown is for the extracted metadata into XML file.
The available Document Generator converting programs
from one language or tool to HTML or XML file is
presented in Table 1. But the frame work takes care of the
languages and tools that support XML format. The
framework needs only few tools which will be available in
the HDFS to support to handle various types of files. After
the converting the source files in to XML, files they are
stored in HBase along with the Project files. The purpose of
storing the XML files, source file and file attributes and file
components in Hbase is to retrieve the code components and
to reduce the software development cost. The components
will be stored in Hbase by creating database design with the
fields project name, project leader, period of project and
code components. Several Sample Document Generator
alone
discussed
and
used
in
this
framework.
CASE STUDY
<compoundname>Calculator</compoundname
>
<sectiondef kind="public-func">
<memberdef kind="function"
<type>int</type>
<definition>int
Calculator::add</definition>
<argsstring>(int i1, int
i2)</argsstring>
<name>add</name>
<param>
<type>int</type>
<declname>i1</declname>
</param>
<param>
<type>int</type>
<declname>i2</declname>
</param>
<location
</memberdef>
<memberdef kind="function"
<type>int</type>
<definition>int
Calculator::subtract</definition>
<argsstring>(int i1, int
i2)</argsstring>
<name>subtract</name>
<param>
<type>int</type>
<declname>i1</declname>
</param>
<param>
<type>int</type>
<declname>i2</declname>
</param>
<location
</memberdef>
</sectiondef>
<listofallmembers>
<scope>Calculator</scope><name>add</na
me></member>
<member
<scope>Calculator</scope><name>subtrac
t</name></member>
</listofallmembers>
</compounddef>
</doxygen>
CONCLUSION
REFERENCES
20021112/, 2002.
20) James Clark. XSL Transformations (XSLT).
Technical
report,
W3C,
http://www.w3.org/TR/xslt, 1999.
21) Stefan Mintert. Schluesselqualifikation; XML
jenseits des Mainstreams. iX, 8:4851, 2005.
22) Charlie Halpern-Hamu. Transform a sample
instance
to
a
schema.
http://incrementaldevelopment.com/papers/xsltrick/
,1999.
23) XSLT Quickly, Bob DuCharme, Manning
Publications, ISBN 1- 930110-11-1.
24) Mastering XML Transformations, Doug Tidwell,
OREILLY, ISBN 0596000537.