Internati
o
nal
Journal of Ele
c
trical
and Computer
Engineering
(IJE
CE)
V
o
l.
5, N
o
. 5
,
O
c
tob
e
r
201
5, p
p
. 1
174
~117
9
I
S
SN
: 208
8-8
7
0
8
1
174
Jo
urn
a
l
h
o
me
pa
ge
: h
ttp
://iaesjo
u
r
na
l.com/
o
n
lin
e/ind
e
x.ph
p
/
IJECE
XML and Semantics
Mohammad Mor
a
di
*
,
Mo
ha
mma
d
R
e
za
K
e
yv
an
po
u
r
**
* Faculty
of
Co
mputer and
Infor
m
ation Technolo
g
y
Engin
eerin
g
,
Qazvin Br
anch
,
Islamic Azad Un
iversity
, Qazvin, Iran
** Departmen
t
o
f
Computer
Engineering
,
Alzahr
a
University
,
Teh
r
an,
Iran
Article Info
A
B
STRAC
T
Article histo
r
y:
Received
May 8, 2015
Rev
i
sed
Ju
l 5
,
2
015
Accepte
d
J
u
l 26, 2015
Since the
early
day
s
of in
trodu
cing
eXtensib
le Markup Langu
age (XML),
owing to its ex
pressive cap
abil
ities and f
l
exib
il
ities
,
it b
ecam
e
the def
act
o
standard for rep
r
esenting
,
storin
g, a
nd interchan
g
ing data on th
e Web. Such
featur
es hav
e
made XML on
e o
f
the bu
ilding
blocks of th
e Semantic Web
.
From another viewpoint, since XML doc
uments could be considered from
conten
t, s
t
ru
ctur
al,
and s
e
m
a
nti
c
as
pect
s,
lev
e
rag
i
ng the
i
r sem
a
nt
ics is ve
r
y
useful and
applicable in
differ
e
n
t
dom
ains. However, XML does
not b
y
itself
introduce an
y
b
u
ilt-in m
e
chanis
m
s
for governing sem
a
ntics. For
this reason
,
man
y
studies have been cond
ucted
on the r
e
presentation of
semantics
within/from
XML docum
ents. This pape
r studies and disc
usse
s diffe
re
nt
as
pects
of
the
m
e
ntion
e
d top
i
c
i
n
the
form
of an
overvi
e
w with
an em
phas
i
s
on the state of
semantics in
XML and
its pr
esentation methods.
Keyword:
Metad
a
ta
Sem
a
n
tic an
n
o
tatio
n
Sem
a
n
tic W
e
b
XM
L
XML sem
a
n
tics
Copyright ©
201
5 Institut
e
o
f
Ad
vanced
Engin
eer
ing and S
c
i
e
nce.
All rights re
se
rve
d
.
Co
rresp
ond
i
ng
Autho
r
:
M
oham
m
ad M
o
ra
di
,
Facul
t
y
o
f
C
o
m
put
er an
d I
n
f
o
rm
at
i
on Tec
h
nol
ogy
En
gi
ne
eri
n
g,
Qazvi
n
Bra
n
c
h
, Islam
i
c Azad
Uni
v
ersity,
Qazvi
n
Islam
i
c
Aza
d
Uni
v
ersi
ty
-
nok
hb
eg
an Blv
d
.
Q
azv
i
n
,
I
r
a
n
,
Em
a
il: Mh
d
.
mo
rad
i
@q
iau
.
ac.ir
1.
INTRODUCTION
Si
nce t
h
e ea
rl
y
day
s
of
i
n
ve
nt
i
on
an
d
owi
ng t
o
i
t
s
ap
pl
i
cabl
e
feat
u
r
es,
XM
L [
1
]
has
becom
e
very
p
opu
lar in v
a
rio
u
s
app
licatio
n
s
in
d
i
fferen
t
d
o
m
ain
s
. In
fact, b
ecau
s
e
of its flex
i
b
ility
an
d ex
ten
s
ib
il
ity t
o
di
ffe
re
nt
d
o
m
a
i
n
s, i
t
i
s
kn
o
w
n as
de
fact
o
st
anda
rd
of
pu
bl
i
s
hi
ng
, st
o
r
i
n
g, a
nd e
x
c
h
an
gi
n
g
dat
a
am
on
g
(het
er
o
g
ene
o
us
) sy
st
em
s and
pl
at
fo
rm
s, spe
c
i
f
i
cal
l
y
t
h
e Web. S
u
c
h
i
n
t
e
re
st
i
ng
feat
ure
s
have
m
a
de XM
L o
n
e
of t
h
e
bui
l
d
i
n
g
bl
oc
ks
of t
h
e Sem
a
nt
i
c
Web
[
2
]
an
d
sho
r
t
l
y
i
n
co
r
p
orat
e
d
i
t
i
n
t
o
di
ffe
re
nt
ap
pl
i
cat
i
ons
i
n
cl
udi
ng
sem
a
nt
i
c
dat
a
i
n
t
e
gr
at
i
on, m
odel
i
n
g,
and
creatio
n of m
a
rk
up
/d
escrip
tion
langu
ag
es.
O
n
t
h
e
o
t
h
e
r
h
a
nd
, alth
ough
X
M
L allo
ws u
s
er
s (pr
ogra
m
m
e
r
s
)
to
def
i
n
e
their
o
w
n
tag
s
and
stru
ctures, it do
es no
t in
trod
uce an
y in
trin
si
c an
d
stan
d
a
rd
m
ech
an
ism
fo
r rep
r
esen
ting
sem
a
n
tics [3
]. In
th
is
rega
rd
an
d
d
u
e
t
o
i
n
val
u
a
b
l
e
appl
i
cat
i
o
ns
of
XM
L sem
a
nt
i
c
s, res
earc
h
ers
ha
ve
pr
o
pose
d
se
veral
m
e
t
hods
t
o
lev
e
rag
e
su
ch se
m
a
n
tics with
resp
ect t
o
d
i
fferen
t
features an
d cap
ab
ilities o
f
XML.
In t
h
i
s
pape
r,
t
h
e m
e
nt
i
oned
t
opi
c i
s
st
udi
ed an
d di
sc
uss
e
d f
r
om
di
ffer
e
nt
aspect
s wi
t
h
a speci
al
em
phasi
s o
n
t
h
e st
at
e of se
m
a
nt
i
c
s i
n
XM
L and i
t
s
ap
pl
i
cat
i
ons an
d
prese
n
t
a
t
i
on m
e
t
h
o
d
s. T
h
e st
r
u
ct
u
r
e of
th
e p
a
p
e
r is as fo
llo
ws: Sectio
n
2
p
r
ov
id
es
a b
r
ief in
tro
ductio
n
o
f
XML. In
sectio
n
3, p
o
s
ition
of XML in
Sem
a
nt
i
c
W
e
b
i
s
di
scu
ssed
.
I
n
sect
i
o
n
4, t
h
e rel
a
t
i
on
o
f
X
M
L an
d sem
a
nt
i
c
s and a
r
e st
udi
e
d
. T
h
e
n
o
t
i
on
o
f
XM
L sem
a
nt
i
c
s an
d i
t
s
di
ffe
r
e
nt
as
pect
s a
n
d
t
y
pes are
di
sc
usse
d i
n
sect
i
o
n
5.
Sect
i
o
n
s
6
an
d
7
di
scus
s
som
e
i
m
p
o
r
tan
t
con
s
id
eration
s
t
o
ward
s XML sem
a
n
tics and
fu
t
u
re wo
rk
s, resp
ectiv
ely.
2.
A
GLI
M
P
S
E ON
X
M
L
XM
L as a su
b
s
et
of st
an
da
rd
gene
ral
i
zed m
a
rk
u
p
l
a
n
gua
g
e
(SGM
L
)
w
a
s devel
o
p
e
d
b
y
t
h
e
W
o
rl
d
W
i
de
We
b C
o
nso
r
t
i
u
m
(
W
3
C
) i
n
19
9
6
t
o
becom
e
a cros
s pl
at
f
o
rm
, hu
m
a
n an
d m
achi
n
e rea
d
abl
e
, a
nd
easy
-
to
-create an
d
-p
ub
lish
stand
a
rd
[1
]. Desp
ite HTML th
at is su
itab
l
e for rep
r
esen
ting
d
a
t
a
(and
in
fo
rm
atio
n
)
,
Evaluation Warning : The document was created with Spire.PDF for Python.
I
S
SN
:
2
088
-87
08
I
J
ECE Vo
l. 5
,
N
o
. 5
,
O
c
tob
e
r
20
15
:
117
4
–
11
79
1
175
XM
L di
rect
l
y
deal
s wi
t
h
dat
a
i
n
a no
n-
pre
s
ent
a
t
i
onal
m
a
nne
r, s
p
eci
fi
ca
l
l
y
st
ori
ng an
d
i
n
t
e
rcha
ngi
ng
dat
a
.
Prob
ab
ly, on
e
o
f
t
h
e m
o
st in
terestin
g and
ap
p
licab
le
features
o
f
XML is th
e po
ssi
b
ility
o
f
creating
custo
m
tag
s
b
y
u
s
ers
(d
o
c
u
m
en
t creato
r
s). Su
ch
a ch
aracteris
tic h
i
g
h
l
y in
creases
flex
ib
ility o
f
XML d
o
c
u
m
en
ts, sin
c
e
u
s
ers are
no
t limited
to
a set o
f
pred
e
f
i
n
e
d
(
a
nd
p
r
o
b
a
b
l
y
m
eani
ngl
ess t
a
gs)
f
o
r
b
u
i
l
d
i
n
g
up t
h
ei
r
d
o
c
u
m
e
nt
s.
Thu
s
, at least
in
co
n
t
rast with
HTML do
cumen
t
s,
XML ones are m
o
re readable
by
h
u
m
ans and m
a
chi
n
es
.
No
net
h
el
ess
,
XM
L p
r
o
v
i
d
e
s
m
eans fo
r de
f
i
ni
ng
gram
m
a
rs and st
ruct
ure
s
and
d
o
es n
o
t
i
n
t
r
o
duce
pr
e
d
efi
n
ed
ru
les and
m
ech
an
ism
s
to
represen
t an
d in
terpret sem
a
n
tics with
in
do
cu
m
e
n
t
s.
3.
ROLE O
F
XML I
N
SE
MA
NTIC
WEB
Brin
g
i
n
g
sem
a
n
tics in
to th
e
Web to shap
e th
e Sem
a
n
tic Web [2
]
was a m
i
lesto
n
e
in its life.
In fact
,
ad
d
i
n
g
sem
a
n
t
ics to
th
e co
n
t
en
t (an
d
po
ssi
b
l
y stru
ct
u
r
e) tak
e
s it to
a
n
e
w lev
e
l
o
f
un
derstand
ab
ility b
y
bo
t
h
hum
ans an
d m
achi
n
es t
h
r
o
ug
h t
h
e i
n
co
rp
o
r
at
i
on
of m
eani
ng i
n
t
o
c
o
nt
en
t
.
Fu
rt
he
r, Se
m
a
nt
i
c
W
e
b e
n
abl
e
s
mach
in
es to
com
p
reh
e
nd
seman
tic do
cu
m
e
nts and
data
[2]
.
In t
h
is re
ga
rd
,
di
ffe
re
nt
dat
a
-
cent
e
ric ap
p
licatio
ns
m
a
y
l
e
verage
s
u
ch
sem
a
nt
i
c
s, suc
h
as
dat
a
m
i
ni
n
g
[4]
a
n
d
re
com
m
e
nder
sy
st
em
s [5]
.
On
e
of th
e im
p
o
r
tan
t
issu
es ab
ou
t th
is
field
i
s
th
e
form
aliza
tion of represe
n
ted
m
eanings
(sem
antics)
in orde
r for standa
rdiza
tion, dom
ain-wide
acceptability, and inc
r
easing porta
b
ility a
nd
reusability. As a
sol
u
t
i
o
n,
O
n
t
o
l
ogi
es
(d
oc
um
ent
s
o
r
fi
l
e
s t
h
at
f
o
rm
al
l
y
defi
ne t
h
e rel
a
t
i
ons am
on
g t
e
rm
s [2]
)
ha
ve
bee
n
p
r
op
o
s
ed
(an
d
m
o
stl
y
u
tilized
in
th
e fo
rm
o
f
an
no
tatio
n) fo
r
d
i
fferen
t d
o
m
ai
n
s
an
d app
licatio
n
areas.
As m
e
nt
i
oned
earl
i
e
r, XM
L i
s
one
of t
h
e m
a
jo
r t
ech
nol
ogi
es fo
r b
u
i
l
d
i
n
g Sem
a
nt
i
c
W
e
b
whi
c
h aim
s
to
prov
id
e an
easy-to
-
u
s
e sy
n
t
ax
fo
r web
d
a
ta. Altho
ugh
it in
tro
d
u
ces t
h
e cap
a
b
ilities o
f
en
cod
i
ng
all k
i
nd
s
of
dat
a
t
h
at
are excha
n
ged am
ong sy
st
em
s [6], t
h
ere are n
o
m
echani
s
m
s
t
o
i
n
t
e
rpret
dat
a
or re
pre
s
ent
m
eani
n
gs
in a structure
d
and form
al way.
In t
h
i
s
way
,
be
si
des t
h
e sy
nt
ax an
d
gram
m
a
r (
p
rese
nt
ed
by
XM
L)
, res
o
ur
ce descri
pt
i
o
n
fram
e
wor
k
(RDF) as a com
p
le
mentary technolo
g
y
fo
r ex
pr
essing
the
m
ean
in
g
s
was in
tr
odu
ced
[
7
]. RDF pr
ovid
e
s a
st
anda
rd
, f
o
rm
al
i
zed, an
d i
n
t
e
ro
pera
bl
e m
odel
t
o
de
scri
b
e
fact
s ab
out
web
res
o
u
r
ces
, w
h
i
c
h
gi
ves
som
e
in
terpretatio
n
s
to
th
e d
a
ta [6
].
To e
xpl
ai
n
t
h
e
rol
e
of
XM
L
and R
D
F a
n
d t
h
ei
r
rel
a
t
i
ons
hi
ps i
n
Sem
a
nt
i
c
W
e
b, i
t
co
ul
d
be si
m
p
l
y
said that XM
L is responsible for
sy
n
t
ax
(d
ata in
terch
a
ng
e), wh
ile RDF is th
oug
h
t
t
o
b
e
a m
e
tad
a
ta d
a
ta
m
odel
.
4.
X
M
L AN
D SEM
A
N
T
ICS
Due to the lac
k
of a widely-accepted a
nd
general
standard, XML’s
flexibilit
y
allows users to create
their own ta
gs
freely and wi
thout an
y predefin
ed
li
m
ita
ti
o
n
i
n
con
t
rast
to
HTML.
Alth
oug
h
t
h
ese tag
s
are
so
m
e
ti
mes
m
e
an
ing
f
u
l
, t
h
ere
is no
g
u
a
ran
t
ee th
at th
ey cou
l
d
b
e
in
tellig
ib
l
e
b
y
tho
s
e who h
a
v
e
n
o
kno
wled
g
e
ab
ou
t th
e
domain
,
o
r
no
n-creato
r
s of
do
cu
m
e
n
t
s. Figu
re
1
illu
strat
e
s so
m
e
ex
am
p
l
es o
f
su
ch
XML
doc
um
ent
s
.
Hen
c
e,
co
m
p
ared
with
Sem
a
n
tic W
e
b
prin
ci
p
l
es,
XML c
a
nnot
be
introduced as
a sem
a
ntic m
a
rkup
lan
g
u
a
g
e
in
its cu
rren
t fo
rm
, b
u
t
prov
id
es
fun
c
tion
a
lity
t
o
ad
d
m
ean
ing
s
to
t
h
e co
n
t
en
t in
an
u
n
s
t
r
u
c
tured
way.
In ot
her
words, t
h
e ide
a
l is to create
doc
um
ents
th
at
a
r
e
r
e
ad
ab
le
(
a
nd
r
e
c
ogn
iz
ab
le
)
b
y
b
o
t
h
hu
ma
n
users a
n
d m
a
chi
n
es acc
or
di
n
g
t
o
t
h
e i
n
t
e
nt
i
on o
f
t
h
ei
r cr
eat
ors.
No
net
h
el
ess, t
h
i
s
poi
nt
i
s
not
n
o
w
a
day
s
perfectly realized.
Sin
ce using
arb
itrary tag
s
is allo
wed
i
n
XML,
i
n
m
o
st
cases, s
o
m
e
ty
pes o
f
i
m
pl
ici
t
(but
not
form
al
ized
) seman
tics
m
a
y b
e
foun
d with
i
n
th
e do
cu
m
e
n
t
s.
In
fact, th
is
is th
e reason
t
h
at XML i
n
pu
b
lic,
kn
o
w
n
as a
se
m
a
nt
i
c
m
a
rku
p
l
a
n
gua
ge.
N
o
net
h
el
ess,
t
h
e
r
e are
m
a
ny
appl
i
cat
i
on a
r
eas
t
h
at
t
a
ke
bene
fi
t
s
o
f
sem
a
nt
i
c
aspect
of
XM
L
i
n
cl
u
d
i
n
g
XM
L m
i
ni
ng
[
8
,
9]
an
d
XM
L r
e
t
r
i
e
val
[1
0,
1
1
]
.
Evaluation Warning : The document was created with Spire.PDF for Python.
I
J
ECE
I
S
SN
:
208
8-8
7
0
8
XML an
d Se
m
ant
i
c
s
(Moha
mm
ad
Mo
rad
i
)
1
176
Fig
u
re
1
.
Ex
am
p
l
es o
f
XML do
cu
m
e
n
t
s wit
h
Fi
n
g
ilish (Persian
wo
rd
s
written
in
Latin
Alp
h
a
b
e
ts) and
vag
u
e t
a
gs
5.
XM
L
S
E
M
A
N
T
I
C
S
There are several issues about the quality of
XML sem
a
ntics; even s
o
m
e
know XML only as a
mark
up
lan
guag
e
- rat
h
er than
a sem
a
n
tic
on
e- th
at
prov
id
es facilities to
im
p
r
ov
e t
h
e
sem
a
n
tics t
h
rough
mark
up
v
i
a
p
r
o
p
o
s
i
n
g th
e
possib
ility o
f
i
n
tro
d
u
c
ing u
s
er-defin
ed
tags
[12
]
. Si
n
ce
XM
L h
a
s
n
o
predefin
ed
appl
i
cat
i
o
n-l
e
v
e
l
pr
ocessi
ng
s
e
m
a
nt
i
c
s, i
t
fo
r
m
al
ly
go
ver
n
s
onl
y
sy
nt
ax
b
u
t
not
sem
a
nt
i
c
s [
13]
.
No
net
h
el
ess
,
d
i
ffere
nt
l
e
vel
s
of sem
a
nt
i
c
s coul
d be seen i
n
XM
L d
o
cum
e
nt
s t
h
at
m
a
y
be l
e
vera
ge
d
in
d
i
fferen
t
app
licatio
n
do
m
a
in
s; thu
s
, t
h
e main
task
of
d
eal
in
g
with
XML
sem
a
n
tics is h
o
w to con
t
ro
l it.
Due to the c
o
ntrove
rsiality
of the topic, to cope
with the intrinsic issu
es, in the re
cent decade
,
researc
h
er
s ha
ve pe
rf
o
r
m
e
d st
udi
es i
n
di
f
f
e
r
ent
di
rect
i
o
ns.
The i
m
port
a
nt
goal
of s
u
c
h
s
t
udi
es
have
be
en t
o
pr
o
pose m
e
t
hods f
o
r re
p
r
ese
n
t
i
ng t
h
e sem
a
nt
i
c
s of XM
L
doc
um
ent
s
i
n
a form
al
and usa
b
l
e
way
t
o
avoi
d
a
m
b
i
g
u
ity and
cap
ture im
p
lici
t
sem
a
n
tics wit
h
in
t
h
e
d
o
c
u
m
en
ts.
Accord
ing
to
th
e literatu
re,
th
e m
a
j
o
r d
i
rectio
n
s
of th
e
research
es arou
nd
th
is top
i
c co
u
l
d
b
e
un
de
rst
o
od
as
f
o
l
l
o
w
s
(
F
i
g
ure
2):
M
a
ppi
ng
(t
ra
ns
fo
rm
i
ng)
XM
L
d
o
cum
e
nt
s i
n
t
o
R
D
F
or
O
W
L t
o
rep
r
ese
n
t
t
h
ei
r sem
a
nt
i
c
s (as i
n
[
1
4-
17]
)
Add
i
ng
form
al
an
d
o
r
g
a
n
i
zed se
m
a
n
tics to
th
e XML
do
cumen
t
s v
i
a seman
tic an
no
tation
or add
ition
a
l
attributes a
n
d s
t
ructures
(as i
n
[18-22])
Ex
tracting
im
p
licit se
m
a
n
tics
with
in
t
h
e
XM
L do
cu
m
e
n
t
s (as in
[23-26
])
These t
h
ree
m
a
in approac
h
es
are c
o
m
p
ared i
n
Ta
ble
1.
Evaluation Warning : The document was created with Spire.PDF for Python.
I
S
SN
:
2
088
-87
08
I
J
ECE Vo
l. 5
,
N
o
. 5
,
O
c
tob
e
r
20
15
:
117
4
–
11
79
1
177
Fi
gu
re
2.
C
l
assi
fi
cat
i
on
of
m
a
jo
r
di
rect
i
o
ns a
r
o
u
n
d
XM
L
se
m
a
nt
i
c
s
Although the c
o
mm
on proble
m
of the
m
e
ntione
d approac
h
es is l
ack of a
widely-acce
pted standa
rd,
i
t
can be
pr
o
v
e
d
t
h
at
m
e
t
hods
t
h
at
bel
o
n
g
t
o
t
h
e cl
ass o
f
a
d
di
n
g
sem
a
nt
i
c
s t
o
t
h
e
XM
L
d
o
cum
e
nt
s ha
ve
m
o
re
adva
nt
age
s
t
h
a
n
ot
hers
. I
n
fa
ct
,
m
a
ppi
n
g
t
o
and e
x
t
r
ac
tion
of sem
a
n
tics
h
a
v
e
sev
e
ral
essen
tial issu
es th
at
mak
e
th
em
les
s
-effectiv
e and,
in
som
e
cases, im
practical.
Fo
r ex
am
p
l
e, wh
ile d
ealing
with
v
a
g
u
e
, ill-fo
rm
ed
, and
in
v
a
lid
d
o
c
u
m
en
ts, su
ch
appro
ach
es face
serious proble
m
s. These approac
h
es are
us
ually used fo
r
deal
i
ng
wi
t
h
l
e
gacy
d
o
cum
e
nt
s or
whe
n
t
h
e
r
e i
s
no
co
n
t
r
o
l
on
mar
k
i
n
g-
up
do
cu
m
e
n
t
s (
p
o
s
t-mar
k
up
app
r
oaches).
While
adding se
m
a
n
tics is feasi
b
le an
d
appl
i
cabl
e
f
o
r
pr
o
duci
n
g
d
o
c
u
m
e
nt
s (p
re-m
ark
u
p
a
p
p
r
oac
h
)
.
Neve
rt
hel
e
ss
,
suc
h
t
ech
ni
q
u
e
s
m
a
y
be use
d
t
o
a
d
d
sem
a
nt
i
c
s t
o
t
h
e e
x
i
s
t
i
ng
d
o
cum
e
nt
s i
n
s
o
m
e
cases. I
n
t
h
e l
a
t
t
e
r app
r
oac
h
,
sem
a
nt
i
c
annot
at
i
on ha
s seve
r
a
l
adva
nt
ages
ove
r a
ddi
ng se
m
a
nt
i
c
st
ruct
ur
es t
o
d
o
c
u
m
en
ts, which
in
clud
e
p
r
eserv
i
n
g
t
h
e
n
a
tu
ral stru
ctur
e
of do
cu
m
e
n
t
s, produ
cing
lesser ad
d
ition
a
l m
a
rk
up
(pieces
of text
), and
being ea
s
ily expanda
b
le
with m
i
nim
u
m side
e
ffects on
docum
e
nts.
Tabl
e
1. C
o
m
p
ari
s
o
n
of
m
a
jor
di
rect
i
o
ns a
r
o
u
n
d
XM
L s
e
m
a
nt
i
c
s
Appr
oach Benefits
Dr
awbacks
Im
pl
e
m
entation Ef
f
i
ciency
Challenges
M
a
pping
(
T
r
a
nsform
ing)
T
a
king
advantages of
available
se
m
a
ntic
r
e
sour
ces and
facilities
Inco
m
p
le
te
m
a
pping,
Dealing with
invalid and vague
docu
m
ents
Relative (depends
on the case)
Med
i
u
m
Pr
ecise m
a
pping,
Choosin
g appr
opr
i
a
te
Ontologies,
Sem
a
ntic m
a
tching
Adding Sem
a
ntics
Presenting
contr
o
lled
se
m
a
ntics,
High level of
pr
ecision
Pr
oducing
additional
m
a
r
kup,
er
r
o
r
pr
one
E
a
siest Higher
Choosin
g appr
opr
i
a
te
(ef
f
i
cient) technique,
Se
m
a
ntic
co
m
p
rehensiveness,
standardization
E
x
tr
action of
Se
m
a
ntics
I
ndependency
f
r
o
m
external
r
e
sour
ces
Unguar
a
nteed
results, low level
of pr
ecision
Hardest
Lower
Dealing with co
m
p
lex
(
n
ested)
,
a
m
biguous and
ill-form
ed docu
m
e
n
ts,
dealing with
m
u
ltil
ingual
docu
m
ents
6.
CO
NSI
D
ER
A
T
IONS
Owi
ng t
o
t
h
e i
n
t
r
i
n
si
c
feat
u
r
e
s
of
XM
L, i
t
c
oul
d be
said
that, al
m
o
st in
every
doc
um
ent, there a
r
e
som
e
t
y
pes of sem
a
nt
i
c
s, whet
her i
m
pl
i
c
it
or expl
i
c
i
t
.
I
n
t
h
i
s
rega
rd
, t
a
ki
n
g
a cl
ose
r
l
o
ok
at
di
ffe
rent
l
e
vel
s
o
f
sem
a
nt
i
c
s wi
thi
n
XM
L
do
c
u
m
e
nt
s i
s
a pr
ereq
ui
si
t
e
f
o
r
fu
rt
he
r act
i
o
ns
t
o
l
e
ve
rage
t
h
em
. From
a gene
ral
p
e
rsp
ectiv
e, XML sem
a
n
tics
may ap
p
ear at th
ree lev
e
ls:
1.
Ele
m
ent
2.
Doc
u
m
e
nt
3.
Gr
ou
p o
f
doc
u
m
ent
s
Evaluation Warning : The document was created with Spire.PDF for Python.
I
J
ECE
I
S
SN
:
208
8-8
7
0
8
XML an
d Se
m
ant
i
c
s
(Moha
mm
ad
Mo
rad
i
)
1
178
An
XML ele
m
ent (or ge
neric
a
lly
tag) m
a
y include a
t
t
r
i
but
es, t
e
xt
, or
ot
h
e
r el
em
ent
s
. Accor
d
i
n
gl
y
,
el
em
ent'
s t
a
g and/
or i
t
s
at
t
r
i
but
e(s
)
co
ul
d
be em
pl
oy
ed t
o
exp
r
ess s
o
m
e
ty
pe of
sem
a
nt
i
c
s
t
o
prese
n
t
mean
in
gfu
l
tag
s
/attrib
u
t
es.
Th
is typ
e
of
se
m
a
n
tic
(
m
eaning) expressi
on is the
m
o
st
st
rai
ght
f
o
r
w
ar
d an
d
sim
p
lest. In m
o
st of XML
docum
e
nts,
the
r
e are m
eaningful elem
ents/a
ttri
butes that are
created
delibe
r
ately
or i
n
t
e
nt
i
onal
l
y
;
but
, t
h
e pr
obl
em
i
s
t
h
at
such i
n
st
a
n
ce
s are not
us
u
a
l
l
y
form
al
i
z
ed an
d, t
h
us, a
r
e not
recogn
izab
le an
d in
terp
reta
bl
e by m
achines.
Doc
u
m
e
nt-lev
el sem
a
ntics refers to t
h
e fa
ct that
, i
n
so
m
e
cases, som
e
t
y
pe of m
e
ani
n
g m
a
y be
inferre
d
from
the docum
e
nts whe
n
they
are
considere
d
as a
whole. In othe
r words, a
n
alyzing all ele
m
ents of a
gi
ve
n d
o
c
u
m
e
nt
co
ul
d
revea
l
som
e
fact
s abo
u
t
i
t
s
t
h
em
e or
p
u
r
p
o
s
e t
h
at
m
a
y
be reg
a
rde
d
as sem
a
nt
i
c
s,
specifically structural sem
a
ntics.
Ext
e
n
d
i
n
g p
r
e
v
i
o
usl
y
-m
ent
i
oned c
onc
ept
s
am
ong se
ver
a
l
rel
a
t
e
d (an
d
m
o
st
l
i
k
el
y
hom
ogeneo
u
s)
doc
um
ent
s
m
a
y
pre
s
ent
m
o
r
e
m
eani
ngf
ul
i
n
f
o
rm
at
i
on b
o
t
h
o
n
t
h
ei
r c
ont
e
n
t
a
n
d
st
r
u
ct
u
r
e.
Suc
h
r
e
l
a
t
e
d
doc
um
ent
s
us
u
a
l
l
y
share si
m
i
lar sc
hem
a
.
7.
FUTU
RE W
O
RKS
B
a
sed o
n
t
h
e
m
e
nt
i
oned
i
ssues a
nd a
p
pr
o
aches t
o
war
d
di
ffe
re
nt
aspe
ct
s of sem
a
nt
i
c
s i
n
XM
L
d
o
c
u
m
en
ts, it
i
s
rev
ealed
th
at, d
e
sp
ite its ap
p
licatio
n
s
in
a broa
d ra
nge
of d
o
m
a
i
n
s t
h
ere i
s
a vast
gap bet
w
e
e
n
p
o
t
en
tial an
d actu
al cap
ab
iliti
es and
o
ppo
rt
un
ity o
f
XML i
n
th
e represen
tat
i
o
n
o
f
sem
a
n
t
i
c
s. In
fact, cu
rren
tly,
th
e p
r
esen
tation
a
l (syn
tactical) asp
ect of XML is
m
o
stly
use
d
to freely express th
e c
o
ntent and struc
t
ure for
diffe
re
nt p
u
r
p
o
s
es.
In this
rega
rd, as the fut
u
re
work, we
will
propose a
ne
w sem
a
ntic
annotation m
e
th
od
for XML
doc
um
ent
s
by
l
e
vera
gi
n
g
i
n
t
r
i
n
si
c feat
u
r
es o
f
XM
L,
nam
e
ly
at
t
r
i
but
es. T
h
en,
we t
a
ke
be
nefi
t
s
o
f
s
u
ch s
o
rt
o
f
sem
a
nt
i
c
s
for m
i
ni
ng XM
L d
o
cum
e
nt
s.
8.
CO
NCL
USI
O
N
Owi
n
g
t
o
th
e h
i
gh
d
e
g
r
ee o
f
flex
ib
ility an
d
ex
ten
s
i
b
ilit
y in
tro
d
u
c
ed
b
y
XML, sh
ortly after
proposi
n
g, it has becom
e
a popular m
eans to store, re
pres
ent, and interc
hange
data bet
w
een system
s. Suc
h
exp
r
essi
ve po
wer
al
so
m
a
kes
XM
L o
n
e of
t
h
e
b
u
i
l
d
i
n
g
bl
ock
s
of Sem
a
nt
i
c
W
e
b.
On t
h
e ot
her
h
a
nd
, i
n
c
ont
r
a
s
t
t
o
t
h
e p
ubl
i
c
t
h
o
u
ght
,
XM
L
by
i
t
s
el
f does
n
o
t
pre
s
ent
a
n
y
m
echani
s
m
s
fo
r g
o
v
er
ni
n
g
sem
a
nt
i
c
s. Thus, t
h
e
r
e are s
e
veral
w
o
rks
on
di
f
f
ere
n
t
aspect
s o
f
t
h
i
s
t
opi
c w
h
i
c
h
p
r
o
v
i
d
e
so
lu
tion
s
fo
r
dealin
g
with
XML se
m
a
n
tics. In
th
is p
a
p
e
r, th
ese issu
es an
d
th
e cu
rren
t
state o
f
sem
a
n
tics in
XM
L d
o
c
u
m
e
nt
s ha
ve
bee
n
st
udi
e
d
i
n
t
h
e
fo
rm
of a co
nc
i
s
e ove
r
v
i
e
w.
Thi
s
w
o
rk
co
u
l
d be c
o
nsi
d
e
r
ed as a
current
state re
port
of t
h
e topi
c for
furthe
r st
udies
.
REFERE
NC
ES
[1]
T. Bra
y
,
et al.
,
“Extensible mar
kup language (
X
ML)”, [online]
Wo
rld Wide
Web Consortium Recommendation
REC-xml-19980
210
, www. w3.
org/TR/1998/R
E
C-xml-19980210
(1998, Accessed: 18
May
2014
).
[2]
T. B
e
rners-L
e
e
,
et a
l
.
, “The Semantic Web”,
S
c
ie
ntifi
c American
,
vol. 284
, no
. 5
,
p
p
.
28-37, 2001.
[3]
T. Kudrass, “Coping with
semantics in XML
docum
ent management”, in
Pr
oceed
ings
of
th
e Ninth
OOPSL
A
Workshop on Behavioral S
e
mantic
s, North
e
aster
n
University
, 20
01, pp
. 150-161
.
[4]
F.B. Foroutan and H.
Khotanlo
u, “Improving
semantic clustering
using with Ontolog
y
and rule
s”,
Internationa
l
Journal of Electrical and
Co
mputer
Eng
i
neer
ing
(
I
JECE)
, vol.4
,
no. 1
,
pp
. 7-15
,
2014.
[5]
K.
B.
Fard,
et a
l
.
, “Recommender s
y
s
t
em based
on se
ma
nt
ic
simi
l
a
ri
ty
”,
Intern
ational Journal of
Electrical
an
d
Computer Engin
eering (
I
JEC
E
)
,
vol. 3
,
no
. 6
,
pp
.
751-761, 2013
.
[6]
M
.
Klein
,
“
X
M
L
, RDF
,
and
rel
a
tives
”
,
I
EEE
Int
e
lligen
t S
y
stems
,
vol.16, no. 2,
pp. 26-28, 2001.
[7]
O. Las
s
i
l
a
and
R
.
S
w
ick, “
Res
our
ce Des
c
r
i
ption
F
r
am
ework (RDF
) M
odel and
S
y
n
t
ax S
p
ec
ifi
cat
ion
”
, [on
line]
W3C
Recommendatio
n
,1999;
ht
tp://w
ww.w3.org/TR/
REC-rdf-s
y
n
t
a
x/
(Accessed: 24
June 2014).
[8]
A. Tag
a
re
lli
and
S
.
Greco
,
“
S
em
antic
clus
ter
i
n
g
of XM
L docu
m
ents
”,
ACM
Transactions on In
formation Systems
(T
OI
S
)
, vol. 28,
no. 1
,
ar
ticle 3, 2
010.
[9]
J.
W.
L
e
e,
et a
l
.
, “Preparation
s
for seman
tics-based XML mining”, in
Pr
oceddings of I
EEE Int
e
rnatio
nal
Conference on
Data Mining
, IC
DM
, 2001, pp. 3
45-352.
[10]
Q. Wang,
et al.
, “Exploiting semantic tags
in XML retri
e
val
”
, in
Focused Retrieval and Evaluatio
n, S.Geva
, et al.,
Eds.
Be
rlin: Springe
r Be
rlin Heide
l
be
rg
, 2010, pp. 133-144.
[11]
D. Bus
caldi,
et al.
, “
T
ag sem
a
nt
ics for the retri
e
val of XML docum
ents”, in
Proceed
ings of the 1st international
symposium on Information and
communication
te
chNologies
(
I
SICT '03)
, Dublin,
Ireland
, 2003
, p
p
. 273-278
.
[12]
R.
Cove
r,
“XML
a
nd
Se
ma
nt
i
c
Transpa
r
e
n
cy
”,
T
echnolog
y Reports
, 1998, Availab
l
e
at:
http://xml.cov
er
pages.or
g/xmlA
ndSemantics.html, (A
ccessed 8
October 2014)
.
[13]
H. Bohring and
S.Auer, “Ma
pping XML to OWL Ontologies”,
Leipziger Informatik-Tage
, vol.7
2, pp.
147-156,
2005.
Evaluation Warning : The document was created with Spire.PDF for Python.
I
S
SN
:
2
088
-87
08
I
J
ECE Vo
l. 5
,
N
o
. 5
,
O
c
tob
e
r
20
15
:
117
4
–
11
79
1
179
[14]
V. Gancheva
, “
X
ML to RDF
Scientif
ic Dat
a
Transform
a
tion”
, in
Proceeding
s of the 5
th
European Computing
Conference (
E
C
C
'11)
, Paris, Fr
ance
, 2011, pp.
354-357.
[15]
D. Van Deursen,
et al.
, “XML to RDF conversion: a g
e
neric appro
ach
”,
in
Proceedings
of Internation
a
l
Conference on
Automated
solutions for Cross Media
Con
t
ent
and Multi-chan
nel Distribution
,
AXMEDIS'08,
Florence, Italia
,
2008, pp
. 138-1
44.
[16]
T. Rodr
igues,
et al.
, “
M
apping
XML to Ex
iting
OW
L ontologi
es”,
in
Pro
ceed
i
ngs of Int
e
rnati
onal Conferen
c
e
WWW/Internet
,
2006, pp
. 72-77
.
[17]
S. Liu,
et al.
, “XSDL: Making xml sema
ntics explici
t
”
,
in
Sem
antic Web and
Databases, C. Bussler, et al., Ed
s.
Berlin:
Springer
Berl
in He
idelb
e
rg
, 2005, pp. 64-
83.
[18]
Y.
Chen,
et al.
,
“ Expression of
XML Implicit Semantics”,
in
Pr
oceed
ings of Th
e Se
cond In
tern
ational S
y
mposium
on Networking a
nd Network Secu
rity (
I
SNNS 201
0)
, Jinggangsha
n, China
, 2010
,
pp. 15-19
.
[19]
G.
Hignette,
et al.
, “
F
uzz
y
s
e
m
a
ntic
anNota
ti
on of xml documents”, in
Pr
oceed
ings
of the CAiS
E’05
WORKSHOPS, The 17th confer
ence on
advan
ced informa
tion systems
engineering,
DisWeb'05
,
Porto, Portugal
,
2005, pp
. 319-3
32.
[20]
F. Goasdoué,
et al.
, “Growing triples on trees: an XML-RDF
hy
brid model for a
nnotated documents”,
The
V
L
DB
Journal
, vo
l.22
,
no.5, pp. 589 -6
13, 2013
.
[21]
Y.
Kotb,
et al.,
“
XML Sem
a
ntic
s”, in
A. S
c
ime,
Ed, W
e
b M
i
ning
: Applications
a
nd T
echn
i
ques
.
Her
s
h
ey,
PA:
Id
ea
Group Publishin
g
, 2005
, pp
. 169
-188.
[22]
A. Renear
,
et al.
, “Towards a
semantics for XML markup”, in
Proceed
ings of the 2002 AC
M symposium o
n
Document eng
i
n
eering, DocEng
'
02, McLean
, VA, USA
, 2002, pp.
119-126.
[23]
N. Aussenac-Gilles and M. Kamel,
“
O
ntolog
y
L
earning b
y
Anal
yz
ing XML Docum
e
nt Structure
and Content
,
” i
n
Pr
oceed
ings
of
the
Int
e
r
natio
nal Jo
int Con
f
er
ence
on KNo
wledge
Dis
c
ov
e
r
y, KNowl
edge
Engin
eer
ing
a
n
d
KNowledge Man
agement, KE
OD
, Madeira,
Portu
gal
, 2009
, pp
. 15
9-165.
[24]
Y.
Q.
Yang,
et al.
, “
A
n auto
m
a
tic sem
a
nt
ic
extr
act
ion a
l
g
o
rithm
for XML docum
ent
”
,
i
n
Proceedings of
International Co
nference on Ma
chine Vision an
d Hu
man- Machine Interfa
ce (
M
VHI)
, Kaifeng,
China
, 2010, pp
.
41-44.
[25]
L. Li
,
et al.
, “
D
iscovering sem
a
ntics from
data-centr
ic XML”, i
n
H. Decker, et
al., Eds, Database and Expert
Systems Applications. Spr
inger B
e
rlin
He
idelb
e
rg
, 2013
, pp
.
88-1
02.
[26]
S. Yang,
et al.
,
“
D
erivation of
OW
L Ontolog
y
from
XML
Docum
e
nts by
Form
al Sem
a
ntic Mo
deling
”
,
Journal of
Computers
, vol.
8, no
. 2
,
2013
.
BIOGRAP
HI
ES OF
AUTH
ORS
Moh
a
mmad
Morad
i
received h
i
s B.S. in Software En
g
i
neer
ing
from Ghazali Higher education
Institute
, Qa
zvin
, Iran
.
Curren
t
l
y
, he
is pursuing
M.S. in Softwar
e
Engin
eer
ing at
Islam
i
c Azad
Universit
y
,
Qaz
v
in Bran
ch, Qa
z
v
in, Ir
an.
He is
i
n
terest
ed in
Sem
a
nti
c
W
e
b and
web 2.0 r
e
l
a
ted
topics
as
wel
l
as
s
o
cial
networks
and data mining.
M
o
hammad Re
za
Key
v
anpour is an Assistant Professor at
Alzah
ra University
, Tehran, Ir
an. He
received his B.S. in Software
Engineering
fro
m Iran University
of Scien
ce
&Technolog
y
,
Tehran
, Iran
.
H
e
receiv
e
d his
M.S. and Ph.D.
in Software
En
gineer
ing from Tarbiat Modar
e
s
Universit
y
,
T
e
hr
an, I
r
an.
His r
e
se
arch
inte
rests in
c
l
ude
im
age r
e
tri
e
val
and d
a
ta
m
i
n
i
ng.
Evaluation Warning : The document was created with Spire.PDF for Python.