Prof Mo Saraee M.Saraee@salford.ac.uk
Professor
XML data has become very popular to represent semi structured data. This has resulted in a growing amount of XML data on the web. This raises a need for languages and tools to manage collections of XML documents as well as to mine interesting information from them. Several attempts at developing XML mining techniques have been proposed. However the topic of mining XML data has received little attention as the data mining community has focused on the development of techniques for extracting common structure from heterogeneous XML data. This project aims to data mine XML data using the XML Query language XQuery. The data mining technique used is the clustering technique of the Nearest Neighbour Algorithm.
This algorithm will be incorporated into XQuery expression which, when implemented using an XQuery implementation tool, will cluster distance based data within the XML document into groups, where the distance between the data is set by a given threshold.
The implementation of the Nearest Neighbour algorithm hopes to be generic and implement a user interface which allows the user to load a XML document for its data to be clustered, choose the data to be clustered within that document, input the threshold and receive the clustered result in an output file. This work would allow XML distance data to be clustered with the Nearest Neighbour algorithm using XQuery, therefore providing a needed data mining implementation on XML data.
Presentation Conference Type | Other |
---|---|
Conference Name | DMIN |
Start Date | Jun 20, 2005 |
End Date | Jun 23, 2005 |
Publication Date | Jun 21, 2005 |
Deposit Date | Oct 26, 2011 |
Publicly Available Date | Oct 26, 2011 |
Related Public URLs | http://dblp.uni-trier.de/db/conf/dmin/dmin2005.html |
Additional Information | Event Type : Conference |
MSaraee_DMIN05_D.pdf
(67 Kb)
PDF
Features in extractive supervised single-document summarization: case of Persian news
(2024)
Journal Article
Deriving Environmental Risk Profiles for Autonomous Vehicles From Simulated Trips
(2023)
Journal Article
DeepClean : a robust deep learning technique for autonomous vehicle camera data privacy
(2022)
Journal Article
Machine learning-based optimized link state routing protocol for D2D communication in 5G/B5G
(2022)
Presentation / Conference
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
Apache License Version 2.0 (http://www.apache.org/licenses/)
Apache License Version 2.0 (http://www.apache.org/licenses/)
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search