Fengjun Li, Bo Luo, Peng Liu, Dongwon Lee, Prasenjit Mitra, Wang-chien Lee, ...
Abstract—An XML brokerage system is a distributed XML database system that comprises data sources and brokers which, respectively, hold XML documents and document distribution information....
Prasenjit Mitra, Advisor Professor, Gio Wiederhold
Databases, with an emphasis on designing techniques and tools to enable integration and interoperation of data and knowledge over the Internet and across enterprises. In particular: schema matching,...
Ying Liu, Lucian V. Lita, R. Stefan Niculescu, Prasenjit Mitra, C. Lee Giles
Since nearly all information is now created digitally, large text databases have become more prevalent than ever. Automatically mining information from these databases proves to be a challenge due to...
Effectively Searching Maps in Web Documents (2009)
Tan, Qingzhao, Mitra, Prasenjit, Giles, C. Lee
Maps are an important source of information in archaeology and other sciences. Users want to search for historical maps to determine recorded history of the political geography of regions at...
Topic Segmentation with Shared Topic Detection and Alignment of Multiple Documents ABSTRACT (2008)
Bingjun Sun, Prasenjit Mitra, Hongyuan Zha, C. Lee Giles, John Yen
Topic detection and tracking [26] and topic segmentation [15] play an important role in capturing the local and sequential information of documents. Previous work in this area usually focuses on...
Processing Transitive Nearest-Neighbor Queries in Multi-Channel Access Environments (2008)
Xiao Zhang, Wang-chien Lee, Prasenjit Mitra, Baihua Zheng
Wireless broadcast is an efficient way for information dissemination due to its good scalability [10]. Existing works typically assume mobile devices, such as cell phones and PDAs, can access only...
Identifying Value Mappings for Data Integration: An Unsupervised Approach (2008)
Jaewoo Kang, Dongwon Lee, Prasenjit Mitra
Abstract. The Web is a distributed network of information sources where the individual sources are autonomously created and maintained. Consequently, syntactic and semantic heterogeneity of data...
Privacy-preserving Semantic Interoperation of Heterogeneous Databases (2008)
Peng Liu, Prasenjit Mitra, Chi-chun Pan
Two major challenges to enabling secure interoperation among web-information sources are resolving semantic heterogeneity across websites and maintaining the privacy of the data and metadata of...
Ying Liu, Lucian V. Lita, R. Stefan Niculescu, Prasenjit Mitra, C. Lee Giles
Since nearly all information is now created digitally, large text databases have become more prevalent than ever. Automatically mining information from these databases proves to be a challenge due to...
Automatic Identification and Data Extraction from 2-Dimensional Plots in Digital Documents (2008)
Brouwer, William, Kataria, Saurabh, Das, Sujatha, Mitra, Prasenjit, Giles, C. L.
Most search engines index the textual content of documents in digital libraries. However, scholarly articles frequently report important findings in figures for visual impact and the contents of...
ChemXSeer: A Web Search Engine and Repository for e-Chemistry (2008)
C. Lee Giles, Prasenjit Mitra, Karl Mueller, James Z. Wang, Bingjun Sun, Levent Bolelli, ...
Cyberinfrastructure or e-science has become crucial for scientific progress and open source systems have greatly facilitated design and implementation. In chemistry, the growth of data has been...
Chemxseer: An echemistry web search engine and repository (2008)
C. Lee Giles, Prasenjit Mitra, Karl Mueller, James Z. Wang, Bingjun Sun, Levent Bolelli, ...
With the amount of scientific digital content available online constantly increasing, the community of science has been increasing its efforts towards automatically collecting and organizing such...
Bingjun Sun, Qingzhao Tan, Prasenjit Mitra, C. Lee Giles
Often scientists seek to search for articles on the Web related to a particular chemical. When a scientist searches for a chemical formula using a search engine today, she gets articles where the...
Identifying Value Mappings for Data Integration: An Unsupervised Approach (2008)
Jaewoo Kang, Dongwon Lee, Prasenjit Mitra
Abstract. The Web is a distributed network of information sources where the individual sources are autonomously created and maintained. Consequently, syntactic and semantic heterogeneity of data...
Extraction and Search of Chemical Formulae in Text Documents on the Web (2007)
Bingjun Sun, Qingzhao Tan, Prasenjit Mitra, C. Lee Giles
Often scientists seek to search for articles on the Web related to a particular chemical. When a scientist searches for a chemical formula using a search engine today, she gets articles where the...
Extraction and Search of Chemical Formulae in Text Documents on the Web (2007)
Bingjun Sun, Qingzhao Tan, Prasenjit Mitra, C. Lee Giles
Often scientists seek to search for articles on the Web related to a particular chemical. When a scientist searches for a chemical formula using a search engine today, she gets articles where the...
HICCUP: Hierarchical Clustering Based Value Imputation using Heterogeneous Gene Expression (2007)
Qiankun Zhao, Prasenjit Mitra, Dongwon Lee, Jaewoo Kang
A novel microarray value imputation method, HICCUP1, is presented. HICCUP improves upon existing value imputation methods in the several ways. (1) By judiciously integrating heterogeneous microarray...
Are Your Citations Clean? (2007)
Dongwon Lee, Jaewoo Kang, Prasenjit Mitra, C. Lee Giles, Byung-Won On
If the are, only one can refer to a distinct document; if not, many can refer to the same document.
Deriving Knowledge from Figures for Digital Libraries (2007)
Xiaonan Lu, James Z. Wang, Prasenjit Mitra, C. Lee Giles
Figures in digital documents contain important information. Current digital libraries do not summarize and index information available within figures for document retrieval. We present our system on...
Automatic Extraction of Data from 2-D Plots in Documents (2007)
Xiaonan Lu, James Z. Wang, Prasenjit Mitra, C. Lee Giles
Two-dimensional (2-D) plots in digital documents contain important information. Often, the results of scientific experiments and performance of businesses are summarized using plots. Although 2-D...
Tableseer: Automatic table metadata extraction and searching in digital libraries (2007)
Ying Liu, Kun Bai, Prasenjit Mitra, C. Lee Giles
Tables are ubiquitous in digital libraries. In scientific documents, tables are widely used to present experimental results or statistical data in a condensed fashion. However, current search engines...
Tableseer: Automatic table metadata extraction and searching in digital libraries (2007)
Ying Liu, Kun Bai, Prasenjit Mitra, C. Lee Giles
Tables are ubiquitous in digital libraries. In scientific documents, tables are widely used to present experimental results or statistical data in a condensed fashion. However, current search engines...
V.: Privacy-preserving semantic interoperation and access control of heterogeneous databases (2006)
Prasenjit Mitra, Chi-chun Pan, Peng Liu
Today, many applications require users from one organization to access data belonging to organizations. While traditional solutions offered for the federated and mediated databases facilitate this by...
Automatic extraction of table metadata from digital documents (2006)
Ying Liu, Prasenjit Mitra, C. Lee Giles, Kun Bai
Tables are used to present, list, summarize, and structure important data in documents. In scholarly articles, they are often used to present the relationships among data and highlight a collection...
Anuj R. Jaiswal, C. Lee Giles, Prasenjit Mitra, James Z. Wang
Increasingly, scientists are seeking to collaborate and share data among themselves. Such sharing is can be readily done by publishing data on the World-Wide Web. Meaningful querying and searching on...
Automatic extraction of table metadata from digital documents (2006)
Ying Liu, Prasenjit Mitra, C. Lee Giles, Kun Bai
Tables are used to present, list, summarize, and structure important data in documents. In scholarly articles, they are often used to present the relationships among data and highlight a collection...
V.: Privacy-preserving semantic interoperation and access control of heterogeneous databases (2006)
Prasenjit Mitra, Chi-chun Pan, Peng Liu
Today, many applications require users from one organization to access data belonging to another organizations. While traditional solutions offered for the federated and mediated databases facilitate...
Ontology mapping discovery with uncertainty (2005)
Prasenjit Mitra, Natalya F. Noy, Anuj R. Jaiswal
Abstract. Resolving semantic heterogeneity among information sources is a central problem in information interoperation, information integration, and information sharing among websites. Ontologies...
Comparative Study of Name Disambiguation Problem using a Scalable Blocking-based Framework (2005)
Byung-won On, Dongwon Lee, Jaewoo Kang, Prasenjit Mitra
In this paper, we consider the problem of ambiguous author names in bibliographic citations, and comparatively study alternative approaches to identify and correct such name variants (e.g.,...
Automatic identification of informative sections of web pages (2005)
Ip Debnath, Prasenjit Mitra, Nirmal Pal, C. Lee Giles
Web-pages – especially dynamically generated ones – contain several items that cannot be classified as the “primary content”, e.g., navigation sidebars, advertisements, copyright notices,...
Automatic extraction of informative blocks from webpages (2005)
Ip Debnath, Prasenjit Mitra, C. Lee Giles
Search engines crawl and index webpages depending upon their informative content. However, webpages — especially dynamically generated ones — contain items that cannot be classified as the...
Automatic identification of informative sections of web pages (2005)
Ip Debnath, Prasenjit Mitra, Nirmal Pal, C. Lee Giles
Abstract—Web pages—especially dynamically generated ones—contain several items that cannot be classified as the “primary content, ” e.g., navigation sidebars, advertisements, copyright...
Identifying content blocks from web documents (2005)
Ip Debnath, Prasenjit Mitra, C. Lee Giles
Abstract. Intelligent information processing systems, such as digital libraries or search engines index web-pages according to their informative content. However, web-pages contain several...
An Empirical Analysis of Development Processes for Anticipatory Standards 1 (2005)
Prasenjit Mitra, Eep Purao, John W. Bagby, Karthikeyan Umapathy, Sharoda Paul, Prasenjit Mitra, ...
, is a non-profit institution devoted to research on network industries, electronic commerce, telecommunications, the Internet, “virtual networks” comprised of computers that share the same...
RNAi-based analysis of CAP, Cbl, and CrkII function in the regulation of GLUT4 by insulin (2004)
Mitra, Prasenjit, Zheng, Xuexiu, Czech, Michael P.
Stimulation of glucose transport by insulin in cultured adipocytes through translocation of intracellular GLUT4 glucose transporters to the plasma membrane has been suggested to require...
A novel phosphatidylinositol(3,4,5)P3 pathway in fission yeast (2004)
Mitra, Prasenjit, Zhang, Yingjie, Rameh, Lucia E., Ivshina, Mariya P., McCollum, Dannel, Nunnari, John J., ...
The mammalian tumor suppressor, phosphatase and tensin homologue deleted on chromosome 10 (PTEN), inhibits cell growth and survival by dephosphorylating phosphatidylinositol-(3,4,5)-trisphosphate...
An algebraic framework for the interoperation of ontologies / (2004)
Mitra, Prasenjit., Wiederhold, Gio Advisor
Submitted to the Department of Electrical Engineering.
On Containment of Conjunctive Queries with Arithmetic Comparisons (2004)
Foto Afrati, Chen Li, Prasenjit Mitra
We study the following problem: how to test if Q2 is contained in Q1, where Q1 and Q2 are conjunctive queries with arithmetic comparisons? This problem is fundamental in a large variety of database...
OMEN: A Probabilistic Ontology Mapping Tool (2004)
Prasenjit Mitra, Natalya F. Noy, Anuj R. Jaiswal
Abstract. Most existing ontology mapping tools do not provide exact mappings. Rather, there is usually some degree of uncertainty. We describe a framework to improve existing ontology mappings using...
Answering queries using views with arithmetic comparisons (2002)
Foto Afrati, Chen Li, Prasenjit Mitra
We consider the problem of answering queries using views, where queries and views are conjunctive queries with arithmetic comparisons (CQACs) over dense orders. Previous work only considered limited...
Answering queries using views with arithmetic comparisons (2002)
Foto Afrati, Chen Li, Prasenjit Mitra
We consider the problem of answering queries using views, where queries and views are conjunctive queries with arithmetic comparisons (CQACs) over dense orders. Previous work only considered limited...
Rewriting queries using views in the presence of arithmetic comparisons (2002)
Foto Afrati, Chen Li, Prasenjit Mitra
We consider the problem of answering queries using views, where queries and views are conjunctive queries with arithmetic comparisons over dense orders. Previous work only considered limited variants...
Mitra, Prasenjit, Vaughan, Patricia S., Stein, Janet L., Stein, Gary S., Van Wijnen, Andre J.
Regulation of histone gene transcription at the G1/S phase transition via the Site II cell cycle control element is distinct from E2F-dependent mechanisms operative at the growth factor-related...
An algebra for semantic interoperability of information sources (2001)
Prasenjit Mitra, Gio Wiederhold
Resolving heterogeneity among the various biological information systems is a crucial problem if we wish to gain value from the many distributed resources available to us. For example, information...
An algorithm for answering queries efficiently using views (2001)
Algorithms for answering queries using views have been used to integrate information from multiple sources. The bucket algorithm, predominantly used to reformulate queries, has two drawbacks. It...
A Scalable Framework for the Interoperation of Information Sources (2001)
Prasenjit Mitra, Gio Wiederhold, Stefan Decker
Resolving heterogeneity among information systems is a crucial necessity if we wish to gain value from the many distributed resources available to us. Problems of heterogeneity in hardware, operating...
A role for nuclear PTEN in neuronal differentiation (2000)
Lachyankar, Mahesh B., Sultana, Nazneen, Schonhoff, Christopher M., Mitra, Prasenjit, Poluha, Wojciech, Lambert, Stephen, ...
Mutations of phosphatase and tensin homolog deleted on chromosome 10 (PTEN), a protein and lipid phosphatase, have been associated with gliomas, macrocephaly, and mental deficiencies. We have...
An information food chain for advanced applications on the www (2000)
Stefan Decker, Jan Jannink, Prasenjit Mitra, Gio Wiederhold, Steffen Staab, Rudi Studer
The Internet and especially the World Wide Web are growing at a tremendous rate. More and more information is becoming directly available for human consumption. But humans have a limited information...
A Graph-Oriented Model for Articulation of Ontology Interdependencies (2000)
Prasenjit Mitra, Gio Wiederhold, Martin Kersten
Ontologies explicate the contents, essential properties, and relationships between terms in a knowledge base. Many sources are now accessible with associated ontologies. Most prior work on the use of...
An Information Food Chain for Advanced Applications on the WWW (2000)
Stefan Decker, Jan Jannink, Prasenjit Mitra, Gio Wiederhold, Steffen Staab, Rudi Studer
The Internet and especially the World Wide Web are growing at a tremendous rate. More and more information is becoming directly available for human consumption. But humans have a limited information...
An algebra for semantic interoperation of semistructured data (1999)
Jan Jannink, Prasenjit Mitra, Erich Neuhold, Srinivasan Pichai, Rudi Studer, Gio Wiederhold
The diversity and availability of information sources on the World Wide Web has set the stage for integration and reuse at an unparalleled scale. There remain obstacles to exploiting the extent of...
An algebra for semantic interoperation of semistructured data (1999)
Prasenjit Mitra, Erich Neuhold, Srinivasan Pichai, Rudi Studer, Gio Wiederhold
The diversity and availability of information sources on the World Wide Web has set the stage for integration and reuse at an unparalleled scale. There remain signicant hurdles to exploiting the...
Semi-automatic Integration of Knowledge Sources (1999)
Prasenjit Mitra, Gio Wiederhold, Jan Jannink
Integration of knowledge from multiple independent sources presents problems due to their semantic heterogeneity. Careful handling of semantics is important for reliable interaction with autonomous...
An Algebra for Semantic Interoperation of Semistructured Data (1999)
Jan Jannink, Prasenjit Mitra, Erich Neuhold, Srinivasan Pichai, Rudi Studer, Gio Wiederhold
The diversity and availability of information sources on the World Wide Web has set the stage for integration and reuse at an unparalleled scale. There remain significant hurdles to exploiting the...
Fast Collective Communication Libraries, Please (1995)
Prasenjit Mitra, David G. Payne, Lance Shuler, Robert Geijn, Jerrell Watts
It has been recognized that many parallel numerical algorithms can be effectively implemented by formulating the required communication as collective communications. Nonetheless, the efficiency of...
Fast Collective Communication Libraries, Please (1995)
Prasenjit Mitra, David G. Payne, Lance Shuler, Robert Geijn, Jerrell Watts
It has been recognized that many parallel numerical algorithms can be effectively implemented by formulating the required communication as collective communications. Nonetheless, the efficiency of...
A novel phosphatidylinositol(3,4,5)P3 pathway in fission yeast
Mitra, Prasenjit, Zhang, Yingjie, Rameh, Lucia E., Ivshina, Maria P., McCollum, Dannel, Nunnari, John J., ...
The mammalian tumor suppressor, phosphatase and tensin homologue deleted on chromosome 10 (PTEN), inhibits cell growth and survival by dephosphorylating phosphatidylinositol-(3,4,5)-trisphosphate...
An Empirical Analysis of Development Processes for Anticipatory Standards
Prasenjit Mitra, Sandeep Purao, John W. Bagby, Karthikeyan Umapathy, Sharoda Paul
There is an evolution in the process used by standards-development organizations (SDOs) and this is changing the prevailing standards development activity (SDA) for information and communications...
Liu, Ying, Lita, Lucian Vlad, Niculescu, Radu Stefan, Mitra, Prasenjit, Giles, C. Lee
Owing to new advances in computer hardware, large text databases have become more prevalent than ever. Automatically mining information from these databases proves to be a challenge due to slow...