Inter
national
J
our
nal
of
Electrical
and
Computer
Engineering
(IJECE)
V
ol.
5,
No.
4,
August
2015,
pp.
840
–
848
ISSN:
2088-8708
840
A
Comparati
v
e
Study:
Change
Detection
and
Querying
Dynamic
XML
Documents
V
ijay
Sonawane,
D
.R.
Rao
Department
of
Computer
Engineering,
K.L.Uni
v
ersity
,
Green
Fields,
Guntur
,
Andhra
Pradesh
Article
Inf
o
Article
history:
Recei
v
ed
Feb
22,
2015
Re
vised
May
14,
2015
Accepted
May
29,
2015
K
eyw
ord:
Change
Detection
Delta
Dynamic
Querying
XML
ABSTRA
CT
The
ef
ficient
management
of
the
dynamic
XML
documents
is
a
comple
x
area
of
research.
The
changes
and
size
of
the
XML
documents
throughout
its
lifetime
are
limitless.The
in-
creasing
applications
that
use
and
e
xchange
XML
data
is
creating
a
demand
for
multi
v
ersion
support
and
store
XML
documents
for
future
references.
Change
detection
is
an
important
part
of
v
ersion
management
to
identi
fy
dif
ference
between
successi
v
e
v
ersions
of
a
doc-
ument.
Doc
ument
content
is
continuously
e
v
olving.
Users
w
anted
to
be
able
to
query
pre
vious
v
ersions,
query
changes
in
documents,
as
well
as
to
retrie
v
e
a
particular
document
v
ersion
ef
ficient
ly
.
In
this
paper
we
pro
vide
comprehensi
v
e
comparati
v
e
analysis
of
v
arious
control
schemes
for
change
detection
and
querying
dynamic
XML
documents.
Copyright
c
2015
Institute
of
Advanced
Engineering
and
Science
.
All
rights
r
eserved.
Corresponding
A
uthor:
V
ijay
Sona
w
ane
Department
of
Computer
Engineering,
K.L.Uni
v
ersity
,
Green
Fields,
Guntur
,
Andhra
Pradesh,
522
502
Email:vijaysona
w
ane11@gmail.com
1.
INTR
ODUCTION
In
todays
information
era
XML
has
rapidly
become
the
language
of
choice
for
data
representation,
storage,
and
e
xchange
o
v
er
the
web
.
Information
a
v
ailable
online
is
not
only
useful
for
indi
vidual
user
b
ut
also
to
b
usiness
or
g
anizations
mostly
for
decision
making
purpose.XML
of
fers
man
y
features
of
b
usiness
functions
including
content
inte
gration
and
intelligence.
So
it
is
v
ery
important
to
maintain
those
documents
properly
and
kno
w
ho
w
to
use
ef
ficiently
information
in
it.
1.1.
Backgr
ound
W
eb
bro
wsers
use
Hyper
T
e
xt
Markup
Language
(HTML)
to
compose
te
xtual,
graphical,
audible,
and
visual
material
in
the
form
of
W
eb
pages.HTML
has
some
limitations
of
fix
ed
set
of
el
ements
for
representing
the
W
eb
content
and
its
representati
o
n.
Therefore,
to
o
v
ercome
those
limitations
W3C
designed
XML
(Extensible
Markup
Language)
which
is
simpler
and
has
self
describing
nature.
Its
format
is
same
as
the
HTML
b
ut
more
fle
xible
and
also
used
for
data
e
xchange
and
support
e
xternal
data
confirmati
on.
The
XML
Schema
or
Document
T
ype
Description
(DTD)
defines
a
structure
of
XML
documents
and
helps
applications
to
authenticate
imported
data.
Se
v
eral
standards
are
a
v
ailable
for
data
sharing
and
management
in
XML
such
as
Extensible
Stylesheet
Language
(XSL)
to
transform
an
XML
document
and
Document
Object
Model
(DOM)
to
allo
w
programs
to
dynamically
access
or
update
an
XML
document[1].
XML
and
database
technology
are
more
corresponding.
The
task
of
managing
loads
of
XML
data
has
increased
the
demand
to
disco
v
er
the
inte
gration
of
XML
with
database
t
echnology
.
XML
databases
are
equally
beneficial
for
content-centric
and
change-centric
applications.
The
change-centric
applications
such
as
softw
are
con-
figuration
and
cooperati
v
e
w
ork
may
e
xploit
dynamic
XML
documents
since
the
content
and
structure
of
a
dynamic
XML
document
changes
continuously
depending
on
the
underlying
application,
such
as
the
content
of
a
website
of
an
e-commerce
web
site.
In
contrast
the
XML
documents
whose
structure
remains
unchanged
are
called
static
XML
document.
Whene
v
er
the
XML
document
content
changes
then
mainly
tw
o
things
happen:
a)
Document
size
gro
ws
b)
Ne
w
v
ersion
of
the
same
document
is
created.A
v
ersion
control
Scheme
may
use
object-re
ferencing
or
deltas
to
detect
and
store
the
changes
between
multiple
v
ersions.
Most
of
the
time
only
small
changes
in
the
document
may
creates
the
ne
w
v
ersion.
Evaluation Warning : The document was created with Spire.PDF for Python.
IJECE
ISSN:
2088-8708
841
Authors
in
[2,
3,
4,
5]
ha
v
e
been
written
on
gro
wing
semi-structured
or
XML
data
and
schema.
Ov
e
rvie
w
of
the
schema
e
v
olution
support
in
se
v
eral
commercial
tools
is
pro
vided
in
[6].
1.2.
Moti
v
ation
An
or
g
anization
can
publish
its
products
information
online
and
allo
w
customers
to
pro
vide
re
vie
ws
and
suggestions
by
bro
wsing
and
querying
the
catalogue.
XML
assures
to
achie
v
e
such
features
of
b
usiness
functions
including
content
inte
gration,
intelligence
and
salv
age.
So
maintenance
and
utilization
of
information
in
ef
ficient
w
ay
contained
in
XML
document
is
v
ery
important.
XML
is
often
called
as
self-describing
data
it
is
designed
in
a
w
ay
that
the
schema
is
repeated
for
each
record
in
the
document.
XML
is
used
to
create
v
ocab
ularies
by
storing
and
e
xchanging
metadata
for
information
e
xchange
on
the
internet.
Demand
to
store
XML
documents
for
future
references
is
increased
as
the
applications
that
use
and
e
xchange
XML
data
is
also
increased.
Importantly
XML
documents
are
dynamic
in
nature
as
its
content
is
application
specific
and
change
across
the
time.
1.3.
The
Pr
oblem
Dynamic
XML
documents
are
applicable
in
man
y
fields
of
information
management
and
create
the
demand
that
it
should
support
multi-v
ersion
documents.
Therefore
it
is
necessary
to
store
dif
ferent
v
ersions
of
XML
documents
with
time.
Storage
of
all
the
v
ersions
of
an
XML
document
is
not
ef
fecti
v
e
solution
as
it
increases
the
duplication
and
mak
es
searching
and
querying
harder
on
gro
wing
document.
So
it
is
essential
to
find
techniques
to
store
and
detects
the
changes
in
multi
v
ersion
XML
documents.
Its
also
important
to
find
schemes
to
ef
ficiently
e
x
ecute
the
cross
v
ersion
queries
o
v
er
dynamic
of
XM
L
document.In
this
paper
,
we
discuss
the
tw
o
main
issues
related
to
dynamic
XML
documents:
(i)
Chang
e
detection
in
dynamic
XML
document
(ii)
Querying
dynamic
XML
documents
2.
D
YN
AMIC
XML
DOCUMENTS
XML
documents
are
mainly
static
XML
document
and
Dynamic
XML
document.In
real
w
orld
application
changes
from
one
v
ersion
of
an
XML
document
to
another
cannot
be
predicted
as
those
changes
are
applications
specific.
V
ersion
control
is
required
to
manage
and
query
the
semi-structured
nature
of
continuously
changing
XML
documents.
Man
y
researches
so
f
ar
proposed
v
arious
schemes
for
v
ersion
control
based
on
v
arious
approaches.
Delta
Based
Approach
[7,
8,
9,
10,
11,
12,
13,
14,
15].
Object
Referencing
[16,
17,
18,
19].
Delta
document
records
the
changes
between
tw
o
consecuti
v
e
v
ersions,
using
delta
one
can
easily
locate
its
pre
vious
v
ersion
so
it
has
been
considered
that
delta
approach
is
superior
to
the
object
referencing
approach.
W
eb
management
system
records
multi-v
ersion
XML
documents
with
changing
time.
Multiple
ef
fort
ha
v
e
been
tak
en
for
the
ef
fecti
v
e
management
of
gro
wing
XML
documents
[20,
21,
22].
3.
CHANGE
DETECTION
Change
detection
is
a
process
of
identifying
dif
ferences
between
successi
v
e
v
ersions
of
a
document.The
concept
of
change
detecti
on
w
as
first
applied
for
relational
database
in
distrib
uted
en
vironment[23].
An
algorithm
w
as
proposed
by
[24]
to
detect
all
changes
of
the
base
table
in
distrib
uted
relational
database
system.
Other
than
relational
data,
success
of
XML
as
the
standard
for
data
e
xchange
has
pa
v
ed
w
ay
for
a
number
of
change
detection
techniques
and
attracted
research
attention.
Change
detection
helps
to
reduce
the
storage
space
of
historical
data
by
a
v
oiding
redundanc
y
and
in
its
ability
to
support
temporal
queries.T
emporal
queries
are
time
specific
and
changes
with
creation,
modification,
and
deletion
of
data.
So
change
detection
is
important
to
mine
historical
data
of
the
documents
to
get
detailed
information
on
the
changes
made
since
its
be
ginning.
Most
of
the
on-line
data
change
rate
is
higher
and
that
change
needs
to
be
monitore
d
ef
fecti
v
ely
.
Change
detection
also
helps
to
impro
v
e
incre
mental
query
e
v
aluation
wherein
there
are
continuous
queries
that
monitors
a
particular
data
source
and
updates
and
to
reduce
cost
of
query
e
v
aluation.
Rather
than
e
v
aluating
query
on
the
entire
data
it
is
beneficial
to
combine
t
he
results
of
query
on
changed
data
with
pre
vious
query
[25].
Fig.
1
sho
ws
three
v
ersions
of
an
XML
document.
In
the
v
ersion
1
and
v
ersion
2
underlined
parts
reflects
the
change
in
both
v
ersions,
these
tw
o
v
ersions
are
alik
e.
No
w
,
v
ersion
3
sho
ws
an
alte
rnati
v
e
v
ersion
of
the
same
XML
document
with
e
v
en
more
considerable
change.
This
dif
ferent
v
ersion
also
has
the
same
information
b
ut
the
y
are
just
arranged
in
a
dif
ferent
schema.
Instinct
i
v
ely
,
due
to
that
structural
changes
v
ersion
3
requires
more
number
of
addition
and
deletion
operations
e
v
en
though
the
y
ha
v
e
the
same
information
as
its
ancestor
v
ersion.
A
Compar
ative
Study:
Chang
e
Detection
and
Querying
Dynamic
XML
Documents
(V
ijay
Sonawane)
Evaluation Warning : The document was created with Spire.PDF for Python.
842
ISSN:
2088-8708
Figure
1.
Consecuti
v
e
v
ersions
of
an
XML
document
4.
DETECTING
CHANGES
IN
D
YN
AMIC
XML
DOCUMENTS
V
ersion
detection
has
man
y
applications
such
as
web
document
ranki
n
g,
duplicate
code
identification,
pla-
giarism
detection,
and
on-line
shop
management.
Similarities
between
v
arious
documents
are
considered
to
detect
the
v
ersions,
for
this
purpose
selection
of
similarity
threshold
v
alue
and
function
is
an
important
issue.
Content
based
and
structure
based
similarity
and
also
application
specific
requirement
are
useful
in
selection
of
similarity
function
for
v
ersion
disco
v
ery
.
T
able1
sho
ws
v
arious
proposals
for
disco
v
ering
changes
in
dynamic
XML
document.
T
able
1.
V
arious
schemes
for
changes
disco
v
ery
in
dynamic
XML
documents
Approach
Proposed
by
author
Dif
f-based
schemes
COB
02,
R
US
08,
SA
C
07,
W
AN
03,
W
ON
02,W
ON
03
Changes
in
documents
te
xtual
content
B
AE
11
Changes
in
document
structure
FLE
06
K
e
y
based
VIY
09
Document
classification
PON
07
,W
AN
03
The
dif
f-based
schemes
may
use
a
delta
document
or
a
consolidated
delta
document
for
v
ersion
management.
A
delta
document
records
only
the
changes
between
tw
o
v
ersions
of
an
XML
document
whereas
a
consolidated
delta
document
contains
the
data
of
the
original
v
ersion
as
well
as
all
deltas.
Moreo
v
er
delta
document
can
be
cate
gorized
as
forw
ard
and
backw
ard
delta.
Author
in
[26]
pro
vide
more
ef
ficient
consolidated
delta
as
it
pro
vide
mapping
between
forw
ard
and
backw
ard
delta
and
reduce
the
storage
space
needed
to
store
intermediate
deltas.
Consolidated
delta
is
only
the
one
document
which
stores
all
the
changes
between
all
v
ersions
of
XML
documents
rather
than
storing
intermediate
deltas.
If
document
is
ha
ving
multiple
v
ersions
at
dif
ferent
time,
then
an
y
v
ersion
T
can
be
obtained
by
an
y
time
moment
using
intermediate
delta
or
consolidated
delta.
T
n
T
n-1
T
n-2
T
3
T
2
T
1
Delta1
Delta n-1
Delta n-2
Delta2
We are here
We need it
Intermediate Delta
T
n
T
n-1
T
n-2
T
3
T
2
T
1
We are here
We need it,We can get
from Consolidated Delta Directly
(Consolidated Delta)
Figure
2.
Intermediate
delta
and
Consolidated
delta
Consolidated
delta
approach
has
adv
antages
o
v
er
intermediate
delta
with
all
three
dra
wbacks
because
con-
solidated
delta
stores
only
changed
part
of
each
v
ersion
rather
than
storing
each
v
ersion
e
v
ery
time.
If
document
is
IJECE
V
ol.
5,
No.
4,
August
2015:
840
–
848
Evaluation Warning : The document was created with Spire.PDF for Python.
IJECE
ISSN:
2088-8708
843
unchanged
then
it
is
not
treated
as
v
ersion.
It
has
a
dra
wback
of
time
o
v
erhead
to
query
backw
ard
the
unchanged
parts
of
a
document.
The
storage
of
intermediate
complete
v
ersions
of
XML
documents
impro
v
es
the
ef
fi
cienc
y
and
space
com-
ple
xity
as
the
required
v
ersion
can
be
created
by
using
the
appropriate
intermediate
complete
v
ersion
instead
of
the
current
v
ersion
only
.
Query
processing
becomes
ef
ficient
while
a
system
stores
the
intermediate
complete
v
ersions
because
there
is
no
need
to
reconstruct
the
intermediate
v
ersions.
The
schema-una
w
are
approaches
use
a
fix
ed
schema
to
store
the
logical
order
of
an
XML
document
and
pro
vide
tw
o
benefits
(1)
Handle
XML
schema
e
v
olution
and
(2)
change
detection
does
not
require
an
y
m
o
di
fication
in
SQL
queries.
The
reference-based
schemes
use
object
refer
-
ences
and
preserv
e
the
logical
order
of
dynamic
XML
documents.
A
v
ersion
is
represented
by
using
object
s,
which
can
(1)
reference
records
representing
unchanged
parts
between
tw
o
v
ersions
and
(2)
be
the
actual
document
objects
records.
W
ell
formed
XML
document
gratifies
some
constraints.
As
an
XML
documents
updates
are
application
specific
then
updated
part
need
to
be
track
ed.
Thus
v
alidation
of
the
time-v
arying
multi-v
ersion
XML
documents
is
important
research
area
[27].The
storage
of
all
the
v
ersions
of
XML
documents
with
more
similarity
introduces
redun-
danc
y
.
The
objecti
v
e
is
to
reduce
the
redundancies
without
loss
of
significant
information.
The
storage
redundanc
y
can
be
a
v
oided
by
storing
only
once
the
elements
which
remain
unchanged
between
multiple
v
ersions.
In
table
2
we
ha
v
e
sho
wn
the
comparati
v
e
study
of
change
management
schemes
by
considering
v
arious
parameters
including
change
disco
v
ery
between
tw
o
v
ersions
of
an
XML
document,
scalability
,
af
fordable
file
size,
use
of
relational
approach
(schema
a
w
are/una
w
are),
delta
approach,
object-referencing
approach
for
change
disco
v
ery
and
Redundanc
y
.
Each
parameter
has
its
o
wn
importance.
Authors
in
literature
[8,
10,
11,
12]
presented
v
arious
schemes
which
can
e
xplicitly
sho
w
changed
parts
of
the
document
to
user
.
Schemes
presented
by
authors
[28,
29,
30,
31,
32,
33,
34]
do
not
sho
w
e
xplicitly
the
changed
part
of
the
document
to
the
user
.
It
is
beneficial
to
store
an
intermediate
complete
XML
documents
v
ersion
which
impro
v
es
the
ef
ficienc
y
and
space
comple
xity
.
Also
the
required
v
er
sion
can
be
created
by
using
the
appropriate
intermediate
com
plete
v
ersion
instead
of
the
current
v
ersi
o
n
only
.
Storing
intermediate
complete
v
ersions
al
so
impro
v
es
querying
speed
as
there
is
reconstruction
of
intermediate
v
ersion
is
not
needed.
In
schema-a
w
are
approaches
fix
ed
schema
are
used
to
store
the
logical
order
of
an
XML
document.
This
gi
v
es
the
profit
of
handling
XML
schema
e
v
olution
and
change
disco
v
ery
without
an
y
modification
in
SQL
queries.
The
object-referencing
approach
for
change
disco
v
ery
uses
object
references
and
maintains
the
logical
order
of
multi-v
ersion
XML
documents.
Objects
are
used
to
represents
v
ersions
which
represents
unchanged
parts
between
tw
o
v
ersions.
5.
Q
UER
Y
EXECUTION
ON
D
YN
AMIC
XML
DOCUMENT
Queries
associated
with
dynamic
XML
document
are
tw
o
types
-
change
centric
and
content
centric.
Change
centric
query
e
x
ecution
may
co
v
er
a
single
v
ersion
or
multiple
v
ersions
of
document.
Content
centric
query
e
x
ecution
co
v
er
single
v
ersion
using
querying
edit
operations
performed
o
v
er
a
single
document
v
ersion.
Queries
co
v
ering
multiple
document
v
ersions
are
e
x
ecuted
o
v
er
multiple
deltas
and
their
results
are
combined
to
find
the
final
answer
set.The
queries
can
be
cate
gorized
in
to
dif
ferent
types
as
follo
ws:
1.
Structural
Projection
Queries
Lik
e
project
the
documents
15th
v
ersion
between
the
5th
and
the
10th
chapter
require
a
continuous
portion
of
the
document
from
a
gi
v
en
v
ersion.
2.
P
ath-Expression
Queries
Lik
e
for
15th
v
ersion,
find
all
table
elements
under
chapter
6
require
a
v
ersion
and
a
re
gular
path
e
xpression
to
output
the
elements
that
satisfy
the
e
xpression
for
a
specified
v
ersion.
3.
P
arent
child
Expression
Queries
Lik
e
find
all
table
elements
under
chapters
in
15th
v
ersion
combine
the
structural
joins
with
a
path
e
xpression.
4.
Content
Based
Selection
Queries
Its
output
v
ersion
satisfy
the
predicates
gi
v
en
in
the
where
clause.
5.
V
ersions
Interv
al
Queries
These
queries
are
supporting
multiple
v
ersions
e
xtended
from
single
v
ersions
queries.
6.
V
ersion
Aggre
g
ation
Queries
These
queries
finds
an
aggre
g
ate
v
alues
of
elements,
which
are
specified
by
a
v
ersion-interv
al
query
.
A
Compar
ative
Study:
Chang
e
Detection
and
Querying
Dynamic
XML
Documents
(V
ijay
Sonawane)
Evaluation Warning : The document was created with Spire.PDF for Python.
844
ISSN:
2088-8708
T
able
2.
V
arious
Change
management
schemes:
Comparati
v
e
Analysis
Proposal
Change
detection
Change
management
approach
Ordered
XML
document
Redundanc
y
Change
detection
Scalable
Af
fordable
file
size
Relational
approach
(schema
a
w
are/
una
w
are)
Find
intermediate
deltas
Store
consolidated
deltas
Store
intermediate
complete
v
ersions
Reference
based
[AMA
00]
N
na
na
N
N
N
N
N
N
N
[CHA
01]
Y
Y
-
N
N
N
N
Y
Y
N
[COB
02]
Y
N
10Mb
N
Y
N
N
Y
Y
N
[W
ON
02]
Y
N
-
N
N
N
N
Y
Y
N
[GER
03]
N
na
na
na
N
N
N
N
N
N
[W
AN
03a]
Y
Y
-
N
N
N
Y
N
Y
N
[W
AN
03]
Y
N
L
T
10mb
N
N
N
N
N
N
N
[W
ON
03]
Y
N
-
N
Y
N
Y
N
Y
N
[WUW
04]
Y
N
-
N
Y
N
N
N
N
N
[GRA
05]
N
na
na
N
N
N
N
N
N
N
[LEO
05]
Y
Y
na
a
w
are
Y
N
N
N
N
N
[LEO
05]
Y
Y
-
una
w
are
Y
N
N
N
N
N
[R
US
05]
Y
Y
-
N
N
Y
N
N
N
N
[GUE
07]
N
na
na
N
N
N
N
N
N
N
[R
OS
07]
N
na
na
N
N
N
N
N
Y
N
[GUE
08]
N
na
na
N
N
N
N
N
N
N
[RIZ
08]
N
na
na
na
na
N
N
N
N
N
[SNO
08]
N
na
na
N
N
N
N
N
N
N
[W
AN
08]
N
na
na
N
N
N
N
N
N
N
[ZHO
09]
N
na
na
N
N
N
N
N
N
N
[BRA
10]
N
na
na
N
N
N
N
N
N
N
[CA
V
10]
N
na
na
N
N
N
N
N
N
N
[CA
V
11]
N
na
na
N
N
N
N
Y
N
N
[FED
13]
N
na
na
N
N
N
N
N
N
N
Note
How
to
r
ead
na
and
-?
If
pr
oposal
failed
in
detecting
c
hang
es
then
scalable
and
af
for
dable
file
size
becomes
illo
gical
then
their
values
ar
e
r
epr
esented
by
na.
-
in
af
for
dable
file
size
par
ameter
indicates
those
pr
oposals
does
not
discuss
about
af
for
dable
file
size
.
IJECE
V
ol.
5,
No.
4,
August
2015:
840
–
848
Evaluation Warning : The document was created with Spire.PDF for Python.
IJECE
ISSN:
2088-8708
845
These
queries
are
dif
ficult
kind
to
respond
ef
fecti
v
ely
.
It
requires
identification
of
parent
and
child
relationship
between
docume
n
t
elements,
and
it
can
be
answered
ef
ficiently
by
a
v
oiding
full
visit
to
all
documents
sub
trees.
Methods
based
on
line
edit
script
or
object
reference
schemes
are
not
ef
fecti
v
e
to
capture
the
logical
order
and
parent
child
relationship
of
a
document
in
v
ersion
XML
documents.
Author
[35,
36,
37,
38]
proposed
numbering
schemes
those
are
used
to
capture
the
logical
order
of
the
documents.
V
arious
numbering
schemes
are
a
v
ailable
lik
e
range
based
[35,
36],
durable
node
[37],
and
prefix-based
[40,
39].
Range
Based
In
this
numbering
scheme
range
of
number
is
associated
with
each
node
to
find
its
position
in
the
XML
tree.
It
reduces
path
e
xpression
queries
to
join
operations.
Durable
Numbering
W
ith
this
scheme
the
insertion
and
deletion
of
the
element
does
not
af
fect
the
numbers
associated
with
elements
so
it
maintains
the
nodes
in
the
pre
order
tra
v
ersal.
Prefix
Based
Numbering
In
this
scheme
parent
node
number
is
used
as
a
prefix
to
assign
a
number
to
a
child
node.
But
as
it
uses
v
ariable
length
encoding
it
needs
more
space
than
other
schemes.
T
able3.
gi
v
es
comparati
v
e
analysis
of
v
arious
schemes
that
ef
ficiently
support
queries
for
XML
documents
by
considering
v
arious
query
type
dif
f,
parent
child
e
xpression,
path
e
xpression,
v
ersion
interv
al,
v
ersion
aggre
g
ation,
structural
projection,
content
based
selection.The
ef
fecti
v
eness
of
an
y
scheme
is
depends
on
ho
w
much
it
support
for
po
werful
queries.
Author
[33]
describe
support
for
po
werful
temporal
queries
e
xpressed
in
XQuery
for
multi-v
ersion
XML
documents
without
introducing
ne
w
constructs
in
the
language.
Similarly
[31]
discusses
support
for
temporal
queries
e
xpressed
in
XP
ath
and
XSL
T
for
multi-v
ersion
XML
documents
without
introducing
ne
w
cons
tructs
in
the
language.Schema
e
v
olution
is
time
specific,
so
e
v
en
it
is
dif
ficult
to
present
the
correct
query
.
Research
in
the
area
of
dynamic
XML
documents
does
not
tra
v
el
around
so
v
ereignty
property
as
these
systems
must
ha
v
e
strong
kno
wledge
of
schema.
Author
[41]
introduces
the
adv
antages
of
design
independence
and
the
potential
statistical
based
methods
ha
ving
such
independence
property
.
6.
FUTURE
RESEARCH
DIRECTION
There
are
still
some
open
research
problems
for
dynamic
XML
documents.
Here
are
some
of
these
issues
we
identified:
1.
In
change
detection
problem
finding
similarity
functions
(formula
to
measure
similarity
between
files)
threshold
v
alue
is
v
ery
important.
Also
while
detecting
the
v
ersions
man
y
other
features
(rather
than
only
content
and
structure)
must
be
considered
together
.
Moreo
v
er
similarity
function
for
v
ersion
detection
should
not
be
appli-
cation
specific.
Hence
defining
ef
fecti
v
e
similarity
function
for
detecting
the
changes
between
XML
documents
is
challenging
research
area.
2.
Self
describing
nature
of
XML
introduces
the
problem
of
v
erbosity
and
results
in
lar
ge
document
size.
XML
Delta
is
document
which
records
changes
between
tw
o
v
ersions
of
an
XML
document.
It
is
possible
to
compress
the
e
v
olving
si
ze
of
XML
documents
and
find
delta
without
their
decompression.
Delta
document
compression
may
sa
v
e
storage
and
computing
po
wer
,
so
finding
compression
technique
which
will
support
time
ef
ficient
query
processing
without
delta
decompression
need
added
research.
7.
CONCLUSION
Data
a
v
ailable
on-line
today
is
in
the
form
of
XML
documents
to
achie
v
e
maintenance
and
management
of
important
information.
The
massi
v
e
storage
of
data
in
XML
documents
needs
the
ef
ficient
data
management
schemes.
Changes
in
XML
documents
are
application
specific
and
the
y
e
v
olv
e
o
v
er
time.
Man
y
applications
require
querying
of
earl
ier
v
ersions
of
an
XML
document.
Storing
all
the
v
ersions
of
documents
may
increases
the
redundanc
y
.
Unlik
e
static
XML
documents,
capricious
and
continuous
changing
nature
of
dynamic
XML
documents
requires
dif
ferent
managing
schemes.
This
paper
discusses
se
v
eral
issues
related
to
dynamic
XML
documents
including
change
management
between
v
ersions
of
XML
documents
by
using
deltas,
object
references,
or
relational
approaches,
v
ersion
support,
and
trend
analysis
queries.
Lastly
we
identified
some
hot
research
directions
in
the
area
of
dynamic
XML
document.
A
Compar
ative
Study:
Chang
e
Detection
and
Querying
Dynamic
XML
Documents
(V
ijay
Sonawane)
Evaluation Warning : The document was created with Spire.PDF for Python.
846
ISSN:
2088-8708
T
able
3.
V
arious
querying
support
schemes:
Comparati
v
e
Analysis
Proposal
Comple
x
Simple
T
emporal
Query
T
ype
F
or
dynamic
XML
documents
Supported
F
or
dynamic
XML
documents
Dif
f
P
arent
child
e
xpression
P
ath
e
xpression
V
ersion
interv
al
V
ersion
aggre
g
ation
Structural
projection
Content
based
selection
[AMA
00]
N
Y
Y
N
[LIQ
01]
N
Y
N
N
N
N
N
N
N
N
N
[MAR
01]
N
[CHA
01]
Y
Y
Y
Y
N
Y
N
Y
Y
N
[BR
U
02]
N
Y
Y
N
N
N
N
N
N
N
[CHA
02]
Y
N
N
Y
N
Y
Y
Y
Y
N
[COB
02]
N
[W
ON
02]
N
N
Y
Y
[V
A
G
03]
Y
N
Y
N
N
N
N
Y
Y
N
[W
AN
03]
N
N
N
Y
N
N
Y
Y
Y
Y
Y
[WUW
04]
N
[GRA
05]
N
[R
OS
07]
N
N
N
N
Y
Y
N
N
Y
Y
Y
[W
AN
08]
N
Y
N
[ZHO
09]
N
[BRA
10]
N
[CA
V
10]
N
[CA
V
11]
N
Note
How
to
r
ead
the
table?
1
:
[AMA
00,W
ON02,W
AN
08]
-
Do
not
pr
o
vide
support
for
comple
x
queries
for
either
a
static
XML
document
or
dynamic
XML
documents.
2
:
[AMA
00]
-
Pr
o
vides
support
of
simple
queries
and
tempor
al
queries.
3
[LIQ
01,W
AN
08]-
Pr
o
vides
support
of
comple
x
tempor
al
queries
for
dynamic
XML
documents.
Howe
ver
,
it
does
not
pr
o
vide
support
of
tempor
al
queries
for
dynamic
XML
documents.
3
:
[COB
02,MAR
01,WUW
04,ZHO
09,BRA
10,CA
V
10,CA
V
11]
-
Do
not
pr
o
vide
any
type
of
query
support.
IJECE
V
ol.
5,
No.
4,
August
2015:
840
–
848
Evaluation Warning : The document was created with Spire.PDF for Python.
IJECE
ISSN:
2088-8708
847
REFERENCES
[1]
[ELH
14]
Elham
B.
F
.,
Hasan
K.,
”Impro
ving
semantic
clustering
using
with
Ontology
and
rules”,
International
Journal
of
Electrical
and
Computer
Engineering
(IJECE)
,V
ol.
4,
No.
1,
pp.
7-15,
Feb
.
2014.
[2]
[ALI
09]
Ali.
K.
A,
and
Pok
orn
y
z,
Comparison
of
XML-based
temporal
models
Adv
anced
Internet
Based
Systems
and
Applications,
LNCS
,
4879,
pp.339-350,
2009.
[3]
[COL
10]
Colazzo.
D,
Guerrini.
G,
Mesiti.
M,
Oliboni,
B,
and
W
aller
.
E,
Document
and
schema
XML
updates,
IGI
Global.
http://dx.doi.or
g/10.4018/978-1-61520-727-5
.ch016,
2010.
[4]
[GUE
09]
Guerrini,
G.,
and
Mesiti,
M.,
XML
schem
a
e
v
olution
and
v
ersioning:
Current
approaches
and
future
trends
Open
and
no
v
el
issues
in
XML
database
applications,
future
directions
and
adv
anced
technologies
,
IDEA
Group,
2009.
[5]
[HAR
11]
Hartung,
M.,
T
erwilliger
,
J.,
and
Rahm,
E.,
Recent
adv
ances
in
schema
and
ontology
e
v
olution.
Schema
matching
and
mapping
Springer
,pp.149-190,
2011.
[6]
[GUE
08]
Guerrini,
G.,
and
Mesiti,M.
X-Ev
olution:
A
comprehensi
v
e
approach
for
XML
schema
e
v
olution
Pro-
ceedings
of
the
international
w
orkshop
on
database
and
e
xpert
systems
application
,pp.251-255,2008.
[7]
[CHA
01b]
Chien.
S.
Y
,
Tsotras.
V
.
J,
and
Zaniolo.
C,
Cop
y-based
v
ersus
edit-based
v
ersion
management
schemes
for
structured
documents,
Proceedings
of
the
ele
v
enth
international
w
orkshop
on
research
issues
in
data
,
engineer
-
ing
pp.
95-102,
2001.
[8]
[LEO
05]
Leonardi.
E,
and
Bho
wmick.
S.
S.
Detecting
changes
on
unordered
XML
documents
using
relational
databases:
A
schema
conscious
approach
Proceedings
of
the
A
CM
conference
on
information
and,
kno
wledge
management
pp.
509-516,
2005.
[9]
[MAD
05]
Madria.
S,
Leonardi.
E,
and
Bho
wmick,
S.
S.
Xandy:
Detecting
changes
on
lar
ge
unordered
XML
doc-
uments
using
relational
databases.
1Proceedings
of
the
international
conference
on
database
systems
for
adv
anced
applications
pp.
711-723,
2005.
[10]
[R
US
05]
Rusu,
L.
I.,
Rahayu,
W
.,
T
aniar
,
D.
Maintaining
v
ersions
of
dynamic
XML
documents
Proceedings
of
the
sixth
international
conference
on
web
information,
systems
engineering
pp.
536-543,
2005.
[11]
[W
ON
02]
W
ong,
R.
K.,
Lam,
N.
Managing
and
querying
multi-v
ersion
XML
data
with
update
logging
Pro-
ceedings
of
the
A
CM
symposium
on
document
,
engineering
pp.
74-81,
2002.
[12]
[WUW
04]
W
uw
ongse,
V
.,
Y
oshika
w
a,
M.,
Amag
asa,
T
.
T
emporal
v
ersioning
of
XML
documents
Proceedings
of
the
Se
v
enth
International
conference
on
digital
libraries:
International
collaboration
and
cross-fer
tilization
pp.
419-428,
2004
[13]
[W
AN
03a]
W
ang,
F
.,
Zaniolo,
C.
T
emporal
queries
in
XML
document
archi
v
es
and
web
w
arehouses
Pro-
ceedings
of
the
tenth
international
symposium
on
temporal
representation
and
reasoning
and
fourth
international
conference
on
temporal
logic
pp.
47-55,
2003(a).
[14]
[MAR
01]
Marian,
A.,
Abiteboul,
S.,
Cobena,
G.,
Mignet,
L.Change-centric
management
of
v
ersions
in
an
XML
w
arehouse
Proceedings
of
the
twenty
se
v
enth
international
conference
on
v
ery
lar
ge
data
base
pp.
581590,
2001.
[15]
[W
ON
03]
W
ong,
R.
K.,
Lam,
N.,
Efcient
re-constructi
on
of
document
v
ersions
based
on
adapti
v
e
forw
ard
and
backw
ard
change
deltas
In
Proceedings
of
the
international
conference
on
database
and
e
xpert
systems
applica-
tions
pp.
266-275,
2003.
[16]
[CHA
01]
Chien.
S.
Y
,
Tsotras.
V
.
J,
Zaniolo.
C.
and
Zhang.
D.
Storing
and
querying
multi
v
ersion
XML
doc-
uments
using
durable
node
numbers
Proceedings
of
the
second
int
ernational
conference
on
web
information,
systems
engineering
pp.
232-241,
2001.
[17]
[CHA
01a]
Chien.
S.
Y
,
Tsotras.
V
.
J,
and
Zaniolo.
C,
Efcient
management
of
multi
v
ersion
documents
by
object
referencing
Proceedings
of
the
twenty
se
v
enth
international
conference
on
v
ery
lar
ge
data
bases
pp.
291-300,
2001.
[18]
[CHA
01c]
Chien.
S.
Y
,
Tsotras,
V
.
J,
and
Zaniolo.
C.
XML
document
v
ersioning,
A
CM
SIGMOD
Records,
pp.
46-53,
2001.
[19]
[CHA
02]
Chien.
S.
Y
,
Tsotras.
V
.
J,
and
Zaniolo.
C.
Efcient
schemes
for
managing
multi
v
ersi
on
XML
document.
V
ery
Lar
ge
Data
Bases,
pp.332-353,
2002.
[20]
[GER
03]
Ger
g
atsoulis.
M,
and
Sta
vrakas.
Y
,
Representing
changes
in
XML
documents
using
dimensions,
Pro-
ceedings
of
the
international
XML
database
,
symposium
pp.208-221,
2003.
[21]
[GRA
05]
Grandi.
F
,
Mandreoli.
F
,
and
T
iberio,
P
.
T
emporal
modelling
and
management
of
normati
v
e
documents
in
XML
format,
Data
and
Kno
wledge
Engineering
,
54(3),
pp.327-354,
2005.
[22]
[GRA
03]
Grandi,
F
.,
Mandreoli,
F
.,
T
iberio,
P
.,
and
Ber
gonzini,
M.,
A
temporal
data
model
and
management
system
for
normati
v
e
te
xts
in
xml
format,
Proceedings
of
the
international
w
orkshop
on
web
information
and
data
management
,
pp.
29-36,
2003
A
Compar
ative
Study:
Chang
e
Detection
and
Querying
Dynamic
XML
Documents
(V
ijay
Sonawane)
Evaluation Warning : The document was created with Spire.PDF for Python.
848
ISSN:
2088-8708
[23]
[SAM
12]
Sam
ini
S.,
Su-Cheng
H.,
Poo
K
uan
H.,
”Bridging
XML
and
Relational
Databases:
An
Ef
fecti
v
e
Map-
ping
Scheme
based
on
Persistent
”,
International
Journal
of
Electrical
and
Computer
Engineering
(IJECE)
,V
ol.2,
No.2,
pp.
239-246,
April
2012.
[24]
[LIN
86]
Lindsay
,
B.
G.,
Haas,
L.
M.,
Mohan,
C.,
Pirahesh,
H.
and
W
ilms,
P
.
F
.,
A
Snapshot
Dif
ferential
Refresh
Algorithm
Proc.
A
CM
SIGMOD
International
Conference
on
Management
of
Data,
W
ashington,
D.C.,
U.S.A.
,
pp.
53-60
1986.
[25]
[F
AB
04]
F
abio
Grandi.
Introducing
an
Annotated
Bibliograph
y
on
T
emporal
and
Ev
olution
Aspects
in
the
W
orld
W
ide
W
eb.
SIGMOD
Record,
V
olume
33,
Number
2,
June
2004.
[26]
[R
US
08]
Rusu,
L.
I.,
Rahayu,W
.,
T
aniar
,
D.,
Storage
techniques
for
multi-v
ersioned
XML
documents,
Pro-
ceedings
of
the
thirteenth
international
conference
on
database
systems
for
adv
ance
applications
pp.
538-545,
2008.
[27]
[BRA
10]
Brahmia.
Z,
Bouaziz.
R,
Grandi,
F
,
and
Ol
iboni,
B.
Schema
v
ersioning
in
Xschema-based
multitem-
poral
XML
repositories,
T
ech.
Rep.
TR-93,T
imeCenter
,
2010.
[28]
[CA
V
10]
Ca
v
alieri
F
,
EXup:
an
engine
for
the
e
v
olution
of
XML
schemas
and
associated
documents,
Proceed-
ings
of
the
international
conference
on
e
xtending
database
technology
pp.
1-10,
2010.
[29]
[CA
V
11]
Ca
v
alieri.
F
,
Guerrini.
G,
Mesiti.
M,
and
Oliboni.
B,
On
the
reduction
of
sequences
of
XML
doc-
ument
and
schema
update
operations
Proceedings
of
the
IEEE
twenty
se
v
enth
international
conference
on
data
engineering
w
orkshops
pp.
77-86,
2011.
[30]
[GUE
07]
Guerrini.
G,
Mesiti.
M,
and
Sorrenti.
M.
A.
XML
schema
e
v
olution:
Increment
al
v
alidation
and
efcient
document
adaptation
proceedings
of
the
international
symposium
on
XML,
database
pp.
92-106,
2007.
[31]
[R
OS
07]
Rosado,
L.
A.,
Mrquez,
A.
P
.,
Gil,
J.
M.
Managing
branch
v
ersi
oning
in
v
ersioned/temporal
XML
documents
[32]
[SNO
08]
Snodgrass,
R.
T
.,
Dyreson,
C.,
Currim,
F
.,
Currim,
S.,
Joshi,
S.
V
alidating
quicksand:
T
emporal
schema
v
ersioning
in
sXSchema
Data
and
Kno
wledge
Engineering
,
65(2),
223-242,
2008.
[33]
[W
AN
08]
W
ang,
F
.,
Zaniolo,
C.
T
emporal
queries
and
v
ersion
managem
ent
for
XML
document
archi
v
es.
Data
and
Kno
wledge
Engineering
,
65(2),
304-324,
2008.
[34]
[ZHO
09]
Zholude
v
,
V
.,
K
ohlhase,
M.
TNTBase:
A
v
ersioned
storage
for
XML.
Balisage:
The
Markup
Confer
-
ence.
Proceedings
of
the
international
symposium
on
XML
,
database
pp.
107-121,
2009.
[35]
[ABI
01]
Abiteboul,
S.,
Kaplan,
H.,
Milo,
T
.
Compact
labeling
schemes
for
ancestor
queries,
Proceedings
of
the
twelfth
A
CM-SIAM
symposium
on
discrete
algorithms
,
pp.
547-556,
2001.
[36]
[BR
U
02]Bruno.
N,
K
oudas.
N,
and
Sri
v
asta
v
a.
D,
Holistic
twig
joins:
optimal
XML
patt
ern
matching,
Proceed-
ings
of
the
A
CM
SIGMOD
international
conference
on
management
of
data
pp.
948-952,
2002.
[37]
[LIQ
01]
Li,
Q.,
Moon,
B.
Inde
xing
and
querying
XML
data
for
re
gul
ar
path
e
xpressions
Proceedings
of
the
twenty
se
v
enth
international
conference
on
v
ery
lar
ge
data
base
pp.
361-370,
2001.
[38]
[W
AN
03]
W
ang,
H.,
P
ark,
S.,
F
an,
W
.,
Y
u,
P
.
V
iST
:
a
dynamic
inde
x
method
for
querying
XML
data
by
tree
structures
Proceedings
of
the
A
CM
SIGMOD
international
conference
on
management
of
data
pp.
110-121,
2003.
[39]
[SAN
08]
Sans,
V
.,
Laurent,
D.
Prex
based
numbering
s
chemes
for
XML:
T
echniques,
applications
and
perfor
-
mances
Proceedings
of
the
VLDB
Endo
wment
,
1(2),
1564-1573,
2008.
[40]
[D
YE
11]
Dyreson.
C.
E,
and
Mekala.
K.
G,
Prex-based
node
numbering
for
temporal
XML,
Proceedings
of
the
international
conference
on
web
information
,
systems
engineering
pp.
172-184,
2011.
[41]
[TER
11]
T
ermehch
y
,
A.,
W
inslett,
M.,
Chodpathumw
an,
Y
.
Ho
w
Schema
Independent
are
Schema
Free
Query
Interf
aces?
Proceedings
of
the
international
conference
on
data
,
engineering
pp.
649-660,
2011.
BIOGRAPHY
OF
A
UTHORS
V
ijay
Sonawane
V
ijay
Sona
w
ane
obtained
Bachelor
De
gree
in
Information
T
echnology
from
North
Maharashtra
Uni
v
ersity
(MS),
India.
Then
he
ea
rned
Master
in
T
echnology
(Computer
science)
from
Shi
v
aji
Uni
v
ersity
,
K
olhapur
(MS),
India.
He
is
currently
PhD
Research
s
cholar
in
computer
science
and
engineering
in
K.L.Uni
v
ersity
,
V
ijaya
w
ada.
He
is
w
orking
as
Assistant
Professor
in
Sandip
Institute
of
T
echnology
and
Research
Centre,
Nashik.
His
major
area
of
interest
is
Data
mining,
Information
retrie
v
al,web
information
management.
He
published
v
arious
papers
in
Jour
-
nals
and
Conferences.
IJECE
V
ol.
5,
No.
4,
August
2015:
840
–
848
Evaluation Warning : The document was created with Spire.PDF for Python.