Please use this identifier to cite or link to this item: http://localhost/handle/Hannan/973
Full metadata record
DC FieldValueLanguage
dc.contributor.authorZhang, Wei Emma. ; (orcid)http://orcid.org/0000-0002-0406-5974. ;en_US
dc.contributor.authorSheng, Quan Z. ;en_US
dc.date.accessioned2013en_US
dc.date.accessioned2020-05-17T08:24:04Z-
dc.date.available2020-05-17T08:24:04Z-
dc.date.issued2018en_US
dc.identifier.isbn9783319949352 ;en_US
dc.identifier.isbn9783319949345 (print) ;en_US
dc.identifier.urihttp://localhost/handle/Hannan/973-
dc.descriptionen_US
dc.descriptionen_US
dc.descriptionen_US
dc.descriptionen_US
dc.descriptionSpringerLink (Online service) ;en_US
dc.descriptionPrinted edition: ; 9783319949345. ;en_US
dc.descriptionen_US
dc.descriptionen_US
dc.description.abstractIn this book, the authors first address the research issues by providing a motivating scenario, followed by the exploration of the principles and techniques of the challenging topics. Then they solve the raised research issues by developing a series of methodologies. More specifically, the authors study the query optimization and tackle the query performance prediction for knowledge retrieval. They also handle unstructured data processing, data clustering for knowledge extraction. To optimize the queries issued through interfaces against knowledge bases, the authors propose a cache-based optimization layer between consumers and the querying interface to facilitate the querying and solve the latency issue. The cache depends on a novel learning method that considers the querying patterns from individualees historical queries without having knowledge of the backing systems of the knowledge base. To predict the query performance for appropriate query scheduling, the authors examine the queriesee structural and syntactical features and apply multiple widely adopted prediction models. Their feature modelling approach eschews the knowledge requirement on both the querying languages and system. To extract knowledge from unstructured Web sources, the authors examine two kinds of Web sources containing unstructured data: the source code from Web repositories and the posts in programming question-answering communities. They use natural language processing techniques to pre-process the source codes and obtain the natural language elements. Then they apply traditional knowledge extraction techniques to extract knowledge. For the data from programming question-answering communities, the authors make the attempt towards building programming knowledge base by starting with paraphrase identification problems and develop novel features to accurately identify duplicate posts. For domain specific knowledge extraction, the authors propose to use clustering technique to separate knowledge into different groups. They focus on developing a new clustering algorithm that uses manifold constraint in the optimization task and achieves fast and accurate performance. For each model and approach presented in this dissertation, the authors have conducted extensive experiments to evaluate it using either public dataset or synthetic data they generated. ;en_US
dc.description.statementofresponsibilityby Wei Emma Zhang, Quan Z. Sheng.en_US
dc.description.tableofcontents1 Introduction -- 2 Cache Based Optimization for Querying Curated Knowledge Bases -- 3 Query Performance Prediction on Knowledge Base -- 4 An Efficient Knowledge Clustering Algorithm -- 5 Knowledge Extraction from Unstructured Data on the Web -- 6 Building Knowledge Bases from Unstructured Data on the Web -- 7 Conclusion. ;en_US
dc.format.extentXIII, 139 p. 41 illus., 32 illus. in color. ; online resource. ;en_US
dc.publisherSpringer International Publishing :en_US
dc.publisherImprint: Springer,en_US
dc.relation.haspart9783319949345.pdfen_US
dc.subjectComputer Scienceen_US
dc.subjectData Miningen_US
dc.subjectInformation Storage and Retrievalen_US
dc.subjectComputer Scienceen_US
dc.subjectData Mining and Knowledge Discoveryen_US
dc.subjectInformation Storage and Retrieval. ;en_US
dc.subjectInformation Systems Applicationsen_US
dc.subject.ddc006.312 ; 23 ;en_US
dc.subject.lccQA76.9.D343 ;en_US
dc.titleManaging Data From Knowledge Bases: Querying and Extractionen_US
dc.typeBooken_US
dc.publisher.placeCham :en_US
Appears in Collections:مدیریت فناوری اطلاعات

Files in This Item:
File Description SizeFormat 
9783319949345.pdf3.86 MBAdobe PDFThumbnail
Preview File
Full metadata record
DC FieldValueLanguage
dc.contributor.authorZhang, Wei Emma. ; (orcid)http://orcid.org/0000-0002-0406-5974. ;en_US
dc.contributor.authorSheng, Quan Z. ;en_US
dc.date.accessioned2013en_US
dc.date.accessioned2020-05-17T08:24:04Z-
dc.date.available2020-05-17T08:24:04Z-
dc.date.issued2018en_US
dc.identifier.isbn9783319949352 ;en_US
dc.identifier.isbn9783319949345 (print) ;en_US
dc.identifier.urihttp://localhost/handle/Hannan/973-
dc.descriptionen_US
dc.descriptionen_US
dc.descriptionen_US
dc.descriptionen_US
dc.descriptionSpringerLink (Online service) ;en_US
dc.descriptionPrinted edition: ; 9783319949345. ;en_US
dc.descriptionen_US
dc.descriptionen_US
dc.description.abstractIn this book, the authors first address the research issues by providing a motivating scenario, followed by the exploration of the principles and techniques of the challenging topics. Then they solve the raised research issues by developing a series of methodologies. More specifically, the authors study the query optimization and tackle the query performance prediction for knowledge retrieval. They also handle unstructured data processing, data clustering for knowledge extraction. To optimize the queries issued through interfaces against knowledge bases, the authors propose a cache-based optimization layer between consumers and the querying interface to facilitate the querying and solve the latency issue. The cache depends on a novel learning method that considers the querying patterns from individualees historical queries without having knowledge of the backing systems of the knowledge base. To predict the query performance for appropriate query scheduling, the authors examine the queriesee structural and syntactical features and apply multiple widely adopted prediction models. Their feature modelling approach eschews the knowledge requirement on both the querying languages and system. To extract knowledge from unstructured Web sources, the authors examine two kinds of Web sources containing unstructured data: the source code from Web repositories and the posts in programming question-answering communities. They use natural language processing techniques to pre-process the source codes and obtain the natural language elements. Then they apply traditional knowledge extraction techniques to extract knowledge. For the data from programming question-answering communities, the authors make the attempt towards building programming knowledge base by starting with paraphrase identification problems and develop novel features to accurately identify duplicate posts. For domain specific knowledge extraction, the authors propose to use clustering technique to separate knowledge into different groups. They focus on developing a new clustering algorithm that uses manifold constraint in the optimization task and achieves fast and accurate performance. For each model and approach presented in this dissertation, the authors have conducted extensive experiments to evaluate it using either public dataset or synthetic data they generated. ;en_US
dc.description.statementofresponsibilityby Wei Emma Zhang, Quan Z. Sheng.en_US
dc.description.tableofcontents1 Introduction -- 2 Cache Based Optimization for Querying Curated Knowledge Bases -- 3 Query Performance Prediction on Knowledge Base -- 4 An Efficient Knowledge Clustering Algorithm -- 5 Knowledge Extraction from Unstructured Data on the Web -- 6 Building Knowledge Bases from Unstructured Data on the Web -- 7 Conclusion. ;en_US
dc.format.extentXIII, 139 p. 41 illus., 32 illus. in color. ; online resource. ;en_US
dc.publisherSpringer International Publishing :en_US
dc.publisherImprint: Springer,en_US
dc.relation.haspart9783319949345.pdfen_US
dc.subjectComputer Scienceen_US
dc.subjectData Miningen_US
dc.subjectInformation Storage and Retrievalen_US
dc.subjectComputer Scienceen_US
dc.subjectData Mining and Knowledge Discoveryen_US
dc.subjectInformation Storage and Retrieval. ;en_US
dc.subjectInformation Systems Applicationsen_US
dc.subject.ddc006.312 ; 23 ;en_US
dc.subject.lccQA76.9.D343 ;en_US
dc.titleManaging Data From Knowledge Bases: Querying and Extractionen_US
dc.typeBooken_US
dc.publisher.placeCham :en_US
Appears in Collections:مدیریت فناوری اطلاعات

Files in This Item:
File Description SizeFormat 
9783319949345.pdf3.86 MBAdobe PDFThumbnail
Preview File
Full metadata record
DC FieldValueLanguage
dc.contributor.authorZhang, Wei Emma. ; (orcid)http://orcid.org/0000-0002-0406-5974. ;en_US
dc.contributor.authorSheng, Quan Z. ;en_US
dc.date.accessioned2013en_US
dc.date.accessioned2020-05-17T08:24:04Z-
dc.date.available2020-05-17T08:24:04Z-
dc.date.issued2018en_US
dc.identifier.isbn9783319949352 ;en_US
dc.identifier.isbn9783319949345 (print) ;en_US
dc.identifier.urihttp://localhost/handle/Hannan/973-
dc.descriptionen_US
dc.descriptionen_US
dc.descriptionen_US
dc.descriptionen_US
dc.descriptionSpringerLink (Online service) ;en_US
dc.descriptionPrinted edition: ; 9783319949345. ;en_US
dc.descriptionen_US
dc.descriptionen_US
dc.description.abstractIn this book, the authors first address the research issues by providing a motivating scenario, followed by the exploration of the principles and techniques of the challenging topics. Then they solve the raised research issues by developing a series of methodologies. More specifically, the authors study the query optimization and tackle the query performance prediction for knowledge retrieval. They also handle unstructured data processing, data clustering for knowledge extraction. To optimize the queries issued through interfaces against knowledge bases, the authors propose a cache-based optimization layer between consumers and the querying interface to facilitate the querying and solve the latency issue. The cache depends on a novel learning method that considers the querying patterns from individualees historical queries without having knowledge of the backing systems of the knowledge base. To predict the query performance for appropriate query scheduling, the authors examine the queriesee structural and syntactical features and apply multiple widely adopted prediction models. Their feature modelling approach eschews the knowledge requirement on both the querying languages and system. To extract knowledge from unstructured Web sources, the authors examine two kinds of Web sources containing unstructured data: the source code from Web repositories and the posts in programming question-answering communities. They use natural language processing techniques to pre-process the source codes and obtain the natural language elements. Then they apply traditional knowledge extraction techniques to extract knowledge. For the data from programming question-answering communities, the authors make the attempt towards building programming knowledge base by starting with paraphrase identification problems and develop novel features to accurately identify duplicate posts. For domain specific knowledge extraction, the authors propose to use clustering technique to separate knowledge into different groups. They focus on developing a new clustering algorithm that uses manifold constraint in the optimization task and achieves fast and accurate performance. For each model and approach presented in this dissertation, the authors have conducted extensive experiments to evaluate it using either public dataset or synthetic data they generated. ;en_US
dc.description.statementofresponsibilityby Wei Emma Zhang, Quan Z. Sheng.en_US
dc.description.tableofcontents1 Introduction -- 2 Cache Based Optimization for Querying Curated Knowledge Bases -- 3 Query Performance Prediction on Knowledge Base -- 4 An Efficient Knowledge Clustering Algorithm -- 5 Knowledge Extraction from Unstructured Data on the Web -- 6 Building Knowledge Bases from Unstructured Data on the Web -- 7 Conclusion. ;en_US
dc.format.extentXIII, 139 p. 41 illus., 32 illus. in color. ; online resource. ;en_US
dc.publisherSpringer International Publishing :en_US
dc.publisherImprint: Springer,en_US
dc.relation.haspart9783319949345.pdfen_US
dc.subjectComputer Scienceen_US
dc.subjectData Miningen_US
dc.subjectInformation Storage and Retrievalen_US
dc.subjectComputer Scienceen_US
dc.subjectData Mining and Knowledge Discoveryen_US
dc.subjectInformation Storage and Retrieval. ;en_US
dc.subjectInformation Systems Applicationsen_US
dc.subject.ddc006.312 ; 23 ;en_US
dc.subject.lccQA76.9.D343 ;en_US
dc.titleManaging Data From Knowledge Bases: Querying and Extractionen_US
dc.typeBooken_US
dc.publisher.placeCham :en_US
Appears in Collections:مدیریت فناوری اطلاعات

Files in This Item:
File Description SizeFormat 
9783319949345.pdf3.86 MBAdobe PDFThumbnail
Preview File