A vision for petabyte data management and analysis services for the Arecibo telescope (2009)
Manuel Calimlim, Jim Cordes, Alan Demers, Julia Deneva, Johannes Gehrke, Dan Kifer, ...
We survey the initial steps of a project to build a data management and data mining system for astronomy data generated by the Arecibo Telescope. The total amount of data that our project will have...
ABSTRACT Large-Scale Collaborative Analysis and Extraction of Web Data (2009)
Felix Weigel, Biswanath Panda, Mirek Riedewald, Johannes Gehrke, Manuel Calimlim
Archived web data is a great resource for scientific research, but poses serious challenges in data processing and management. We demonstrate the Web Lab Collaboration Server, a platform and service...
A vision for petabyte data management and analysis services for the Arecibo telescope (2008)
Manuel Calimlim, Jim Cordes, Alan Demers, Julia Deneva, Johannes Gehrke, Dan Kifer, ...
We survey the initial steps of a project to build a data management and data mining system for astronomy data generated by the Arecibo Telescope. The total amount of data that our project will have...
Three Case Studies of Large-Scale Data Flows (2008)
William Y. Arms, Selcuk Aya, Manuel Calimlim, Johannes Gehrke, Dave Lifka, Mirek Riedewald, ...
We survey three examples of large-scale scientific workflows that we are working with at Cornell: the Arecibo sky survey, the CLEO high-energy particle physics experiment, and the Web Lab project for...
MAFIA: A maximal frequent itemset algorithm for transactional databases (2001)
Doug Burdick, Manuel Calimlim, Johannes Gehrke
We present a new algorithm for mining maximal frequent itemsets from a transactional database. Our algorithm is especially efficient when the itemsets in the database are very long. The search...