Education Curriculum Vitae (2008)
– The major research topics include Web information retrieval and data mining, search engine log analysis, automatic information organization, text classification, clustering algorithms, and...
Automatic Query Taxonomy Generation for Information Retrieval Applications (2008)
Shui-lung Chuang, Lee-feng Chien
Abstract. It is crucial for information retrieval systems to learn more about what users search for in order to fulfill the intent of search. This paper introduces a new problem, called query...
Topic Hierarchy Generation for Text Segments: A Practical Web-based Approach (2008)
documents and queries from users, into a well-formed topic hierarchy. In this paper, we address the problem of generating topic hierarchies for diverse text segments with a general and practical...
Collaborative Wrapping: A Turbo Framework for Web Data Extraction (2008)
Shui-lung Chuang, Chengxiang Zhai
To access data sources on the Web, a crucial step is wrapping, which translates query responses, rendered in textual HTML, back into their relational form. Traditionally, this problem has been...
Context-aware wrapping: Synchronized data extraction (2007)
The deep Web presents a pressing need for integrating large numbers of dynamically evolving data sources. To be more automatic yet accurate in building an integration system, we observe two problems:...
Context-aware wrapping: Synchronized data extraction (2007)
The deep Web presents a pressing need for integrating large numbers of dynamically evolving data sources. To be more automatic yet accurate in building an integration system, we observe two problems:...
LiveClassifier: Creating Hierarchical Text Classifiers through Web Corpora (2004)
Huang, Chien-Chung, Chuang, Shui-Lung, Chien, Lee-Feng
Many Web information services utilize techniques of information extraction (IE) to collect important facts from the Web. To create more advanced services, one possible method is to discover thematic...
Using a web-based categorization approach to generate thematic metadata form texts (2004)
Chien-chung Huang, Shui-lung Chuang, Lee-feng Chien
Conventional tools for automatic metadata creation mostly extract named entities or text segments from texts and annotate them with information about persons, locations, dates, and so on. However,...