Minimax rates of estimation for high-dimensional linear regression over $\ell_q$-balls (2009)
Raskutti, Garvesh, Wainwright, Martin J., Yu, Bin
Consider the standard linear regression model $\y = \Xmat \betastar + w$, where $\y \in \real^\numobs$ is an observation vector, $\Xmat \in \real^{\numobs \times \pdim}$ is a design matrix,...
The composite absolute penalties family for grouped and hierarchical variable selection (2009)
Zhao, Peng, Rocha, Guilherme, Yu, Bin
Extracting useful information from high-dimensional data is an important focus of today's statistical research and practice. Penalized loss function minimization has been shown to be effective for...
Rocha, Guilherme V., Wang, Xing, Yu, Bin
Since its early use in least squares regression problems, the l1-penalization framework for variable selection has been employed in conjunction with a wide range of loss functions encompassing...
Tao Shi, Mikhail Belkin, Bin Yu
This paper focuses on obtaining clustering information in a distribution when iid data are given. First, we develop theoretical results for understanding and using clustering information contained in...
1 On Model Selection Consistency of the Elastic Net When p>> n (2009)
Abstract: In this paper, we study the model selection property of the Elastic net. In the classical settings when p (the number of predictors) and q (the number of predictors with non-zero...
Sparse Markov source estimation via transformed Lasso (2009)
We establish a connection between Lasso-type L1 regularization and learning variable length Markov chains (VLMCs). This is achieved by a parameterization of discrete-valued finite-memory Markov...
Estimating sparse models from multivariate discrete data via transformed Lasso (2009)
The type of L1 norm regularization used in Lasso and related methods typically yields sparse parameter estimates where most of the estimates are equal to zero. We study a class of estimators obtained...
Computational Prediction of Novel Non-Coding RNAs in Arabidopsis thaliana (2009)
Song, Dandan, Yang, Yang, Yu, Bin, Zheng, Binglian, Deng, Zhidong, Lu, Bao-Liang, ...
Background: Non-coding RNA (ncRNA) genes do not encode proteins but produce functional RNA molecules that play crucial roles in many key biological processes. Recent genome-wide transcriptional...
PGP – Pretty Good Privacy (2008)
_ any two parties can authenticate each other to the extent that they are willing to undertake the transaction. _ Traditional face-to-face transactions _ Electronic commerce. Trust management in...
Traditional approaches to knowledge management are essentially limited to document management. However, much knowledge in organizations or communities resides in an informal social network and may be...
Learning the Quality of Sensor Data in Distributed Decision Fusion (2008)
Abstract- The problem of decision fusion has been studied for distributed sensor systems in the past two decades. Various techniques have been developed for either binary or multiple hypotheses...
Experience with Multi-Camera Tele-Immersive Environment (2008)
Jin Liang, Zhenyu Yang, Bin Yu, Yi Cui, Klara Nahrstedt
With their ability to extract, transfer and render 3D models of real world objects, multi-camera, tele-immersive environments are becoming promising next-generation of tele-communication systems. By...
A POMDP Approach to Token-Based Team Coordination (2008)
Yang Xu, Paul Scerri, Bin Yu, Michael Lewis, Katia Sycara
Efficient coordination among large numbers of heterogeneous agents promises to revolutionize the way in which some complex tasks, such as responding to urban disasters can be performed. Token-based...
ABSTRACT Searching Social Networks £Ý (2008)
A referral system is a multiagent system whose member agents are capable of giving and following referrals. The specific cases of interest arise where each agent has a user. The agents cooperate by...
Lasso-type recovery of sparse representations for high-dimensional data (2008)
The Lasso is an attractive technique for regularization and variable selection for high-dimensional data, where the number of predictor variables $p_n$ is potentially much larger than the number of...
Incentive mechanisms for agent-based peer-to-peer systems (2008)
Abstract. Most of the existing research in peer-to-peer systems focuses on protocol design and doesn’t consider the rationality of each peer. One phenomenon that should not be ignored is free...
RSSM: A Rough Sets based Service Matchmaking Algorithm (2008)
The past few years have seen the Grid is evolving as a service-oriented computing infrastructure. It is envisioned that various resources in a future Grid environment will be exposed as services....
Numerous studies have shown that interpersonal communication acts as an important channel for gathering information. But if we wish to rely on interpersonal communication, we still need to figure out...
Zhenyu Yang, Klara Nahrstedt, Yi Cui, Bin Yu, Jin Liang, Sang-hack Jung, ...
Tele-immersive 3D multi-camera room environments are starting to emerge and with them new challenging research questions. One important question is how to organize the large amount of visual data,...
AVPUC: Automatic Video Production with User Customization (2008)
Nowadays, multiple video cameras are employed for live broadcast and recording of almost all major social events, and all these camera streams are aggregated and rendered into one video program for...
Sam I. Saffer, Van N. Kincaid, Bin Yu
In this paper, we propose a method that combines both web structure and usage mining called conceptual clustering. First, a conceptual cluster repository is created by dynamically classifying and...
Extending the ONESAF Testbed (2008)
Joseph A. Giampapa, Katia Sycara, Sean Owens, Robin Glinton, Young-woo Seo, Bin Yu, ...
On behalf of:
Distributed message relaying is an important function of a peer-topeer system to discover service providers. Existing search protocols in unstructured peer-to-peer systems either create huge burden...
Large Sample Properties of Weighted (2008)
Monte Carlo Estimators, Paul Glasserman, Bin Yu
informs ® doi 10.1287/opre.1040.0148 © 2005 INFORMS A general approach to improving simulation accuracy uses information about auxiliary control variables with known expected values to improve the...
On Model Selection Consistency of Lasso On Model Selection Consistency of Lasso (2008)
Sparsity or parsimony of statistical models is crucial for their proper interpretations, as in sciences and social sciences. Model selection is a commonly used method to find such models, but usually...
Geographic Routing in Distributed Sensor Systems without Location Information (2008)
Abstract- Geographic routing protocols have been widely used in microsensor networks, however, they cannot directly apply to distributed mobile sensor systems as mobile sensors often do not know...
Binning in Gaussian Kernel Regularization (2008)
Gaussian kernel regularization is widely used in the machine learning literature and proven successful in many empirical experiments. The periodic version of the Gaussian kernel reg-ularization has...
Challenges in Building Very Large Teams (2008)
Paul Scerri Yang, Yang Xu, Jumpol Polvichai, Bin Yu, Steven Okamoto, Katia Sycara
Coordination of large numbers of unmanned aerial vehicles is di#cult due to the limited communication bandwidth available to maintain cohesive activity in a dynamic, often hostile and unpredictable...
Embracing Statistical Challenges in the Information Technology Age (2008)
Information Technology is creating an exciting time for statistics. In this article, we review the diverse sources of IT data in three clusters: IT core, IT systems, and IT fringe. The new data...
Detection of Daytime Arctic Clouds Using MISR and MODIS Data (2008)
Tao Shi, Eugene E. Clothiaux, Bin Yu, Amy J. Braverman, David N. Groff
Amongst the 36 spectral radiances available on the Moderate Resolution Imaging Spectroradiometer (MODIS) seven of them are used operationally for detection of clouds in daytime polar regions. While...
Erosols Are Airborne, J. Diner, Thomas P. Ackerman, Theodore L. Anderson, Jens Bösenberg, Amy J. Braverman, ...
This paper provides an overview of the PARAGON strategy. Supporting details are contained in a set of four companion papers, the contents of which are summarized below. Seinfeld et al. (2004) discuss...
Daytime Arctic Cloud Detection Based on Multi-angle Satellite Data with Case Studies (2008)
Tao Shi, Bin Yu, Eugene E. Clothiaux, Amy J. Braverman
Global climate models predict that the strongest dependences of surface air temperatures on increasing atmospheric carbon dioxide levels will occur in the northern high latitudes of Alaska,...
Binning in Gaussian Kernel Regularization (2008)
Gaussian kernel regularization is widely used in the machine learning literature and has proved successful in many empirical experiments. The periodic version of Gaussian kernel regularization has...
Comment: Extracting more diagnostic information from a single run using Cusum Path Plot (2008)
Bin Yu is Assistant Professor, Department of Statistics, University of California, Berkeley, CA 94720-3860. Research supported in part by Grant DMS-9322817 from the National Science Foundation and...
A Scalable Overlay Video Mixing Service Model (2008)
this paper, we have described a distributed video mixing model for delivery of multiple video streams in a single video stream. For the next step, we will be working on (1) a video mixing tree...
Heavy weight of the camouflage materials was always the main problem. To solve it, the low infrared emissivity fibers with the radar absorbing property (LIFR) were prepared. The low infrared...
Festschrift in Honor of Jorma Rissanen on the Occasion of his 75th Birthday (2008)
Grünwald, Peter, Myllymäki, Petri, Tabus, Ioan, Weinberger, Marcelo, Yu, Bin
This Festschrift contains contributions from many well-known information theorists. It was presented to Jorma Rissanen, the founder of the MDL Principle and arithmetic coding, on occasion of his 75th...
Guilherme V. Rocha, Peng Zhao, Bin Yu
Given n observations of a p-dimensional random vector, the covariance matrix and its inverse (precision matrix) are needed in a wide range of applications. Sample covariance (e.g. its eigenstructure)...
High-dimensional covariance estimation by minimizing ℓ1-penalized log-determinant (2008)
Pradeep Ravikumar, Garvesh Raskutti, Martin J. Wainwright, Bin Yu
divergence
Information In The Non-Stationary Case (2008)
Vincent Q. Vu, Bin Yu, Robert E. Kass
Information estimates such as the “direct method ” of Strong et al. (1998) sidestep the difficult problem of estimating the joint distribution of response and stimulus by instead estimating the...
Information In The Non-Stationary Case (2008)
Vincent Q. Vu, Bin Yu, Robert E. Kass
Information estimates such as the “direct method ” of Strong et al. (1998) sidestep the difficult problem of estimating the joint distribution of response and stimulus by instead estimating the...
Coding and Compression: a Happy Union of Theory and Practice (2007)
This paper laid down the foundations for what is now known as information theory in a mathematical framework that is probabilistic (see e.g. Cover and Thomas 1991, Verd'u 1998). That is, Shannon...
Multiple Copy Image Denoising Via Wavelet Thresholding (2007)
Grace Chang, Bin Yu, Martin Vetterli
This work addresses the recovery of an image from noisy observations when multiple noisy copies of the image are available. The standard method is to compute the average of these copies. Since the...
Lossy Compression and Wavelet Thresholding for Image Denoising (2007)
Grace Chang Bin, Bin Yu, Martin Vetterli
In recent work, it was proposed to use lossy compression to remove noise from corrupted signals, based on the rationale that a reasonable compression method retains the dominant signal features more...
Estimating L¹ Error of Kernel Estimator: Monitoring Convergence of Markov Samplers (2007)
In many Markov chain Monte Carlo problems, the target density function is known up to a normalization constant. In this paper, we take advantage of this knowledge to facilitate the convergence...
Information and the Clone Mapping of Chromosomes (2007)
A clone map of part or all of a chromosome is the result of organizing order and overlap information concerning collections of DNA fragments called clone libraries. In this paper, the expected amount...
MICROARRAY IMAGE COMPRESSION AND THE EFFECT OF COMPRESSION LOSS (2007)
Microarray image technology is a powerful tool for monitoring the expression of thousands of genes simultaneously. The expressed desire of experimentalists to store raw image files due to costly...
ABSTRACT Detecting Deception in Reputation Management ∗ (2007)
We previously developed a social mechanism for distributed reputation management, in which an agent combines testimonies from several witnesses to determine its ratings of another agent. However,...
James Pan, Minh-van Ngo, Christy Woo, Jung-suk Goo, Paul Besser, Bin Yu, ...
This letter describes a replacement (damascene) metal gate NMOSFET with TaSiN and PVD TaN as stacked gate electrode. The goal is to perform the “gate electrode engineering ” in order to change...
Peter Biihlmann, Eth Zfirich, Bin Yu
Jiang, Lugosi and Vayatis, and Zhang ought to be congratulated for their different works on the original AdaBoost algorithm with early stopping (Jiang), an gFpenalized version of boosting (Lugosi and...
ABSTRACT Searching Social Networks £Ý (2007)
A referral system is a multiagent system whose member agents are capable of giving and following referrals. The specific cases of interest arise where each agent has a user. The agents cooperate by...
We analyze a class of Monte Carlo estimators that are stochastically weighted averages of independent replications. The weights are chosen to constrain the weighted averages of auxiliary control...
Large scale inference and tomography for network monitoring and diagnosis. (2007)
Mark Coates, Alfred Hero, Robert Nowak, Bin Yu
Today's Internet is a massive, distributed network which continues to explode in size as ecommerce and related activities grow. The heterogeneous and largely unregulated structure of the...
This paper investigates a variant of boosting, L 2 Boost, which is constructed from a functional gradient descent algorithm with the L 2-loss function. Based on an explicit stagewise retting...
nm Gate Length Strained Silicon CMOS (2007)
Qi Xiang, Jung-suk Goo, Haihong Wang, Yayoi Takamura, Bin Yu, ...
We report on performance and scalability for strained Si CMOS devices with Lgate down to 25nm, 1.2nm nitrided oxide and NiSi. Good control of short channel effects was achieved. NiSi was found to be...
MISR Cloud Detection over Ice and Snow Based on Linear Correlation Matching (2007)
Tao Shi, Bin Yu, Amy Braverman
Cloud detection is a crucial step in any climate modelling or prediction. Multi-angle Imaging SpectroRadiometer (MISR) was launched in 1999 by NASA to provide 9 angle and 4 band data to retrieve or...
Traditional approaches to knowledge management are essentially limited to document management. However, much knowledge in organizations or communities resides in an informal social network and may be...
Internet-based interactive HDTV (2007)
Abstract. Traditional interactive TV systems depend on expensive hardware, proprietary formats, and a closed-loop endto-end approach, which greatly limits scalability and extensibility of TV...
Coverage Adjusted Entropy Estimation ∗ (2007)
Vincent Q. Vu, Bin Yu, Robert E. Kass
Data on “neural coding ” have frequently been analyzed using information-theoretic mea-sures. These formulations involve the fundamental, and generally difficult statistical problem of estimating...
Coverage Adjusted Entropy Estimation ∗ (2007)
Vincent Q. Vu, Bin Yu, Robert E. Kass
Data on “neural coding ” have frequently been analyzed using information-theoretic mea-sures. These formulations involve the fundamental, and generally difficult statistical problem of estimating...
Grouped and hierarchical model selection through composite absolute penalties (2006)
Peng Zhao, Guilherme Rocha, Bin Yu
Extracting useful information from high-dimensional data is an important part of the focus of today’s statistical research and practice. Penalized loss function minimiza-tion has been shown to be...
Sean Owens, Paul Scerri, Robin Glinton, Bin Yu, Katia Sycara
To perform large-scale coordination in real-world environments requires that many individually complex technologies come together to form integrated solutions. We present an application where several...
A multi-stream adaptation framework for bandwidth management in 3D teleimmersion (2006)
Zhenyu Yang, Bin Yu, Klara Nahrstedt, Ruzena Bajscy
Tele-immersive environments will improve the state of collaboration among distributed participants. However, along with the promise a new set of challenges have emerged including the real-time...
Paul Scerri, Sean Owens, Robin Glinton, Bin Yu, Katia Sycara
To perform large-scale coordination in real-world environments requires that many individually complex technologies come together to form integrated solutions. In this paper, we present an...
On Model Selection Consistency of Lasso (2006)
Sparsity or parsimony of statistical models is crucial for their proper interpretations, as in sciences and social sciences. Model selection is a commonly used method to find such models, but usually...
Some Theory for Generalized Boosting Algorithms (2006)
Peter J. Bickel, Ya'acov Ritov, Alon Zakai, Bin Yu
We give a review of various aspects of boosting, clarifying the issues through a few simple results, and relate our work and that of others to the minimax paradigm of statistics. We consider the...
Peter Bühlmann, Bin Yu, Yoram Singer, Larry Wasserman
We propose Sparse Boosting (the SparseL 2 Boost algorithm), a variant on boosting with the squared error loss. SparseL 2 Boost yields sparser solutions than the previously proposed L 2 Boosting by...
Grouped and hierarchical model selection through composite absolute penalties (2006)
Peng Zhao, Guilherme Rocha, Bin Yu
Recently much attention has been devoted to model selection through regularization methods in regression and classification where features are selected by use of a penalty function (e.g. Lasso in...
Scalable and reliable data delivery in mobile ad hoc sensor networks (2006)
Bin Yu, Paul Scerri, Katia Sycara
This paper studies scalable data delivery algorithms in mobile ad hoc sensor networks with node and link failures. Many algorithms have been developed for data delivery and fusion in static...
On model selection consistency of lasso (2006)
Peng Zhao, Bin Yu, David Madigan
Sparsity or parsimony of statistical models is crucial for their proper interpretations, as in sciences and social sciences. Model selection is a commonly used method to find such models, but usually...
Collaborative Dancing in Tele-immersive Environment (2006)
Zhenyu Yang, Bin Yu, Wanmin Wu, Klara Nahrstedt
We first present the tele-immersive environments developed jointly by University of Illinois at Urbana-Champaign and University of California at Berkeley. The environment features 3D full and real...
Scalable and reliable data delivery in mobile ad hoc sensor networks (2006)
Bin Yu, Paul Scerri, Katia Sycara
This paper studies scalable data delivery algorithms in mobile ad hoc sensor networks with node and link failures. Many algorithms have been developed for data delivery and fusion in static...
Collaborative Dancing in Tele-immersive Environment (2006)
Zhenyu Yang, Bin Yu, Ross Diankov
We present a study of collaborative dancing between remote dancers in a tele-immersive environment which features 3D full and real body capturing, wide field of view, multi-display 3D rendering, and...
Yang, Zhiyong, Ebright, Yon W., Yu, Bin, Chen, Xuemei
microRNAs (miRNAs) and small interfering RNAs (siRNAs) in plants bear a methyl group on the ribose of the 3′ terminal nucleotide. We showed previously that the methylation of miRNAs and siRNAs...
Some theory for generalized boosting algorithms (2006)
Peter J. Bickel, Ya’acov Ritov (corresponding, Alon Zakai, Bin Yu
We give a review of various aspects of boosting, clarifying the issues through a few simple results, and relate our work and that of others to the minimax paradigm of statistics. We consider the...
Hourglass Multimedia Content and Service Composition Framework for Smart Room Environements (2005)
Klara Nahrstedt, Bin Yu, Jin Liang, Yi Cui
Pervasive environments with their ubiquitous multimedia devices such as video-enabled cell-phones, laptops, PC tablets, digital cameras, HDTV plasma displays are becoming integral part of our...
An integrated token-based algorithm for scalable coordination (2005)
Yang Xu, Paul Scerri, Bin Yu, Steven Okamoto, Michael Lewis, Katia Sycara
Efficient coordination among large numbers of heterogeneous agents promises to revolutionize the way in which some complex tasks, such as responding to urban disasters can be performed. However,...
Space Weather Modeling Framework: A new tool for the space (2005)
Science Community Gabor, Gábor Tóth, Igor V. Sokolov, Tamas I. Gombosi, David R. Chesney, C. Robert Clauer, ...
This paper presents the design and implementation of the SWMF and some demonstrative tests. Future papers will describe validation (comparison of model results with measurements) and applications to...
An Integrated Token-Based Algorithm for Scalable (2005)
Coordination Yang Xu, Yang Xu, Paul Scerri, Bin Yu, Steven Okamoto, Michael Lewis, ...
E#cient coordination among large numbers of heterogeneous agents promises to revolutionize the way in which some complex tasks, such as responding to urban disasters can be performed. However, state...
Towards Flexible Coordination of Large Scale Multi-Agent Teams (2005)
Yang Xu, Elizabeth Liao, Paul Scerri, Bin Yu, Mike Lewis, Katia Sycara
this paper, we address the limitations of existing models as they apply to very large agent teams. We develop algorithms aimed at flexible and e#cient coordination, applying a decentralized social...
An Agent-Based C4ISR Testbed (2005)
Joseph Giampapa Katia, Katia P. Sycara, Sean R, Young-woo Seo, Bin Yu
This paper describes six functional extensions to the OneSAF Testbed Baseline (OTB) modeling and simulation environment: (1) the addition of modules to access OTB's terrain database, (2) a...
A POMDP Approach to Token-Based Team Coordination (2005)
Yang Xu Paul, Paul Scerri, Bin Yu, Michael Lewis, Katia Sycara
E#cient coordination among large numbers of heterogeneous agents promises to revolutionize the way in which some complex tasks, such as responding to urban disasters can be performed. Token-based...
An Integrated Token-Based Algorithm for Scalable (2005)
Coordination Yang Xu, Yang Xu, Paul Scerri, Bin Yu, Steven Okamoto, Michael Lewis, ...
E#cient coordination among large numbers of heterogeneous agents promises to revolutionize the way in which some complex tasks, such as responding to urban disasters can be performed. However, state...
An evidential model of multisensor decision fusion for force aggregation and classification (2005)
Bin Yu, Joseph Giampapa, Sean Owens, Katia Sycara
This paper describes airborne sensor networks for target detection and identification in military applications. One challenge is how to process and aggregate data from many sensor sources to generate...
A generalized discriminant analysis based on a new optimization criterion is presented. The criterion extends the optimization criteria of the classical Linear Discriminant Analysis (LDA) when the...
Bin Yu, Katia Sycara, Joseph Giampapa, Sean Owens
The paper describes airborne sensor networks for target tracking and identification in military applications. The raw information about targets from airborne sensors is uncertain and often noisy. One...
A dynamic pricing mechanism for p2p referral systems (2004)
Most existing research on peer-to-peer systems focuses on protocol design. In this paper, we consider the issue of free riding in peer-to-peer referral systems. Free riders are agents that refuse...
Simulation for American Options: Regression Now or Regression Later? (2004)
This article investigates connections between a weighted Monte Carlo technique and regression-based methods for this problem. The weighted Monte Carlo technique is shown to be equivalent to a...
Network tomography: recent developments (2004)
Rui Castro, Mark Coates, Gang Liang, Robert Nowak, Bin Yu
Today's Int ernet is a massive, dist([/#][ net work which cont inuest o explode in size as ecommerce andrelatH actH]M/# grow. Thehet([H(/#]H( and largelyunregulatS stregula of t/ Int/HH3...
Extending the ONESAFTestbed (2004)
Into Isr Testbed, Joseph A. Giampapa, Katia Sycara, Sean Owens, Robin Glinton, Young-woo Seo, ...
This article describes how the modeling and simulation environment of the OneSAF Testbed Baseline (OTB) v1.0 has been extended to enable the testing of heterogeneous algorithms that are being...
Cloud Detection over Snow and Ice Using MISR Data (2004)
Tao Shi Bin, Tao Shi, Bin Yu, Eugene E. Clothiaux, Amy J. Braverman
Clouds play a major role in Earth's climate and cloud detection is a crucial step in the processing of satellite observations in support of radiation budget, numerical weather prediction and...
this paper, we propose the Boosted Lasso (BLasso) algorithm that is able to produce an approximation to the complete regularization path for general Lasso problems. BLasso is derived as a coordinate...
Network tomography: recent developments (2004)
Rui Castro, Mark Coates, Gang Liang, Robert Nowak, Bin Yu
Abstract. Today’s Internet is a massive, distributed network which continues to explode in size as e-commerce and related activities grow. The heterogeneous and largely unregulated structure of the...
In this paper, we propose the Boosted Lasso (BLasso) algorithm that is able to produce an approximation to the complete regularization path for general Lasso problems. BLasso is derived as a...
Michael Quist, Golan Yona, Bin Yu
We present a novel approach for embedding general metric and nonmetric spaces into lowdimensional Euclidean spaces. As opposed to traditional multidimensional scaling techniques, which minimize the...
Extending the OneSAF Testbed into a C4ISR Test Bed (2004)
Joseph A. Giampapa, Katia Sycara, Sean Owens, Robin Glinton, Young-woo Seo, Bin Yu, ...
This paper describes how the modeling and simulation environment of the OneSAF Testbed Baseline (OTB) v1.0 has been extended to enable the testing of heterogeneous algorithms that are being designed...
Lossless Online Bayesian Bagging (2004)
Bagging frequently improves the predictive performance of a model. An online version has recently been introduced, which attempts to gain the benefits of an online algorithm while approximating...
Michael Quist, Golan Yona, Bin Yu
We present a novel approach for embedding general metric and nonmetric spaces into lowdimensional Euclidean spaces. As opposed to traditional multidimensional scaling techniques, which minimize the...
Yu, Wei, Pirollo, Kathleen F., Yu, Bin, Rait, Antonina, Xiang, Laiman, Huang, Weiqun, ...
Successful cancer gene therapy depends on the development of non‐toxic, efficient, tumor cell‐ specific systemic gene delivery systems. Our laboratory has developed a systemically...
Yu, Bin, Wakao, Setsuko, Fan, Jilian, Benning, Christoph
Phosphatidic acid is a key intermediate for chloroplast membrane lipid biosynthesis. De novo phosphatidic acid biosynthesis in plants occurs in two steps: first the acylation of the sn-1 position of...
Incentive Mechanisms for Peer-to-Peer Systems (2003)
Abstract. Most of the existing research in peer-to-peer systems focuses on protocol design and doesn’t consider the rationality of each peer. One phenomenon that should not be ignored is free...
Boosting with early stopping: convergence and consistency (2003)
Abstract Boosting is one of the most significant advances in machine learning for classification and regression. In its original and computationally flexible version, boosting seeks to minimize...
Boosting with early stopping: convergence and consistency (2003)
Boosting is one of the most significant advances in machine learning for classification and regression. In its original and computationally flexible version, boosting seeks to minimize empirically a...
Large Sample Properties of Weighted Monte Carlo Estimators (2003)
A general approach to improving simulation accuracy uses information about auxiliary control variables with known expected values to improve the estimation of unknown quantities. We analyze weighted...
Boosting with early stopping: convergence and consistency (2003)
Boosting is one of the most significant advances in machine learning for classification and regression. In its original and computationally flexible version, boosting seeks to minimize empirically a...
An adaptive social network for information access: Theoretical and experimental results (2003)
Bin Yu, Mahadevan Venkatraman, Munindar P. Singh
We consider a social network of software agents who assist each other in help-ing their users find information. Unlike in most previous approaches, our architec-ture is fully distributed and includes...
Video Summarization Based On User Log Enhanced Link Analysis (2003)
Efficient video data management calls for intelligent video summarization tools that automatically generate concise video summaries for fast skimming and browsing. Traditional video summarization...
Simultaneous Gene Clustering and Subset Selection (2003)
For Sample Classification, Rebecka Jornsten, Bin Yu
Motivation: The microarray technology allows for the simultaneous monitoring of thousands of gene expression for each sample. The high-dimensional gene expression data can be used to study...
Searching Social Networks (2003)
A referral system is a multiagent system whose member agents are capable of giving and following referrals. The specific cases of interest arise where each agent has a user. The agents cooperate by...
Boosting with Early Stopping: Convergence and Consistency (2003)
Tong Zhang Bin, Tong Zhang, Bin Yu
Boosting is one of the most significant advances in machine learning for classification and regression. In its original and computationally flexible version, boosting seeks to minimize empirically a...
Comparing Bayes model averaging and stacking when model approximation error cannot be ignored (2003)
We compare Bayes Model Averaging, BMA, to a non-Bayes form of model averaging called stacking. In stacking, the weights are no longer posterior probabilities of models; they are obtained by a...
Number of paths versus number of basis functions in American option pricing (2003)
An American option grants the holder the right to select the time at which to exercise the option, so pricing an American option entails solving an optimal stopping problem. Difficulties in applying...
he test set error is smallest at our largest tree, but we cannot make the trees larger (more complex) to potentially decrease the test set error (we could enlarge them a bit by requiring at least one...
Incentive Mechanisms for Peer-to-Peer Systems (2003)
Abstract. Most of the existing research in peer-to-peer systems focuses on protocol design and doesn’t consider the rationality of each peer. One phenomenon that should not be ignored is free...
An Adaptive Social Network for Information Access: (2003)
Theoretical And Experimental, Bin Yu, Mahadevan Venkatraman, Munindar P. Singh
We consider a social network of software agents who assist each other in helping their users find information. Unlike in most previous approaches, our architecture is fully distributed and includes...
Boosting with Early Stopping: Convergence and Consistency (2003)
Tong Zhang, Bin Yu, Becomes An L
Boosting is one of the most significant advances in machine learning for classification and regression. In its original and computationally flexible version, boosting seeks to minimize empirically a...
Detecting Deception in Reputation Management (2003)
We previously developed a social mechanism for distributed reputation management, in which an agent combines testimonies from several witnesses to determine its ratings of another agent. However,...
Number of Paths Versus Number of Basis Functions in American Option Pricing (2003)
An American option grants the holder the right to select the time at which to exercise the option, so pricing an American option entails solving an optimal stopping problem.
Strained Silicon NMOS with Nickel-Silicide Metal Gate (2003)
Qi Xiang, Jung-suk Goo, James Pan, Bin Yu, Shibly Ahmed, ...
Strained Si NMOS transistors with Lgate down to 35nm were fabricated using NiSi as a metal gate electrode material for the first time. Compared to poly gate devices, NiSi metal gate devices showed...
Comparing Bayes Model Averaging and Stacking When Model Approximation Error Cannot be Ignored (2003)
We compare Bayes Model Averaging, BMA, to a non-Bayes form of model averaging called stacking. In stacking, the weights are no longer posterior probabilities of models; they are obtained by a...
Simultaneous gene clustering and subset selection for sample classification via MDL (2003)
Motivation: The microarray technology allows for the simultaneous monitoring of thousands of genes for each sample. The high-dimensional gene expression data can be used to study similarities of gene...
Emergence of Agent-based Referral Networks (2002)
We consider the problem of finding desirable services and sources of information in distributed multiagent systems. Traditional solutions, such as centralized directories, may be inaccurate and may...
Multiterminal estimation - extensions and geometric interpretation (2002)
In multiterminal estimation the basic theoretical question is to prove the existence of encoding and decoding schemes that can achieve a certain rate of compression, while resulting in a particular...
FinFET scaling to 10 nm gate length (2002)
Bin Yu, Shibly Ahmed, Haihong Wang, Scott Bell, Chih-yuh Yang, Cyrus Tabery, ...
While the selection of new “backbone ” device structure in the era of post-planar CMOS is open to a few candidates, FinFET and its variants show great potential in scalability and...
Bagging is one of the most effective computationally intensive procedures to improve on unstable estimators or classifiers, useful especially for high dimensional data set problems. Here we formalize...
Minimum Description Length Model Selection Criteria for Generalized Linear Models (2002)
This paper derives several model selection criteria for generalized linear models (GLMs) following the principle of Minimum Description Length (MDL). We focus our attention on the mixture form of...
Mark Coates, Alfred Hero, Robert Nowak, Bin Yu
Today's Internet is a massive, distributed network which continues to explode in size as ecommerce and related activities grow. The heterogeneous and largely unregulated structure of the...
Search in Referral Networks (2002)
Bin Yu And, Bin Yu, Munindar P. Singh
Referral systems have been proposed to assist people in finding potential experts in person-to-person social networks, in which each user is assigned a software agent and software agents help...
Boosting with the L_2-Loss: Regression and Classification (2002)
This paper investigates a computationally simple variant of boosting, L 2 Boost, which is constructed from a functional gradient descent algorithm with the L 2 -loss function. As other boosting...
Perceptual Audio Coding using Adaptive Pre- and Post-Filters and Lossless Compression (2002)
Gerald Schuller, Bin Yu, Dawei Huang, Bernd Edler
This paper proposes a versatile perceptual audio coding method that achieves high compression ratios and is capable of low encoding/decoding delay. It accommodates a variety of source signals...
Distributed Reputation Management For Electronic Commerce (2002)
This paper considers the problem of automatically collecting ratings about a given party from others. Our approach involves a distributed agent architecture and adapts the mathematical theory of...
Multiterminal Estimation - (2002)
Extensions And Geometric, Rebecka Jornsten, Bin Yu
In multiterminal estimation the basic theoretical question is to prove the existence of encoding and decoding schemes that can achieve a certain rate of compression, while resulting in a particular...
An Evidential Model of Distributed Reputation Management (2002)
For agents to function effectively in large and open networks, they must ensure that their correspondents, i.e., the agents they interact with, are trustworthy. Since no central authorities may...
Maximum Pseudo Likelihood Estimation in Network Tomography (2002)
Network monitoring and diagnosis are key to improving network performance. The difficulties of performance monitoring lie in today's fast growing Internet, accompanied by increasingly...
Lossless Coding of Audio Signals UsingCascaded Prediction (2001)
Gerald Schuller, Bin Yu, Dawei Huang
Anovel predictive lossless coding scheme is proposed. The prediction is based on a new weighted cascaded least mean squared (WCLMS) method. WCLMS is especially designed for music/speech signals. It...
Peter Buhlmann Eth, Peter Bühlmann, Bin Yu
Bagging is one of the most e#ective computationally intensive procedures to improve on unstable estimators or classifiers, useful especially for high dimensional data set problems. Here we formalize...
Large scale inference and tomography for network monitoring and diagnosis. (2001)
Mark Coates, Alfred Hero, Robert Nowak, Bin Yu
Today's Internet is a massive, distributed network which continues to explode in size as ecommerce and related activities grow. The heterogeneous and largely unregulated structure of the...
Bagging is one of the most e#ective computationally intensive procedures to improve on unstable estimators or classifiers, useful especially for high dimensional data set problems. Here we formalize...
Adaptive wavelet thresholding for image denoising and compression (2000)
S. Grace Chang, Student Member, Bin Yu, Senior Member, Martin Vetterli
Abstract—The first part of this paper proposes an adaptive, data-driven threshold for image denoising via wavelet soft-thresholding. The threshold is derived in a Bayesian framework, and the prior...
Wavelet Thresholding for Multiple Noisy Image Copies (2000)
S. Grace Chang, Bin Yu, Martin Vetterli
Abstract—This correspondence addresses the recovery of an image from its multiple noisy copies. The standard method is to compute the weighted average of these copies. Since the wavelet...
Trust and reputation management in a small-world network (2000)
Mahadevan Venkatraman, Bin Yu, Munindar P. Singh
Successful commerce relies heavily upon the reputations that the different parties acquire through their dealings with each other. We view an e-commerce community as a social network, which supports...
Wavelet thresholding via mdl for natural images (2000)
We study the application of Rissanen's Principle of Minimum Description Length (MDL) to the problem of wavelet denoising and compression for natural images. After making a con-nection between...
A social mechanism of reputation management in electronic communities (2000)
{byu, mpsingh}eos.ncsu.edu Abstract. rust is important wherever agents must interact. We consider the important case of interactions in electronic communities, where the agents assist and represent...
A scalable method for estimating network traffic matrices (2000)
Jin Cao, Scott V, Er Wiel, Bin Yu, Zhengyuan Zhu
Abstract--Traffic matrices are extremely useful for network configuration, management, engineering, and pricing. Direct measurement is, however, expensive in general and impossible in some cases....
Bin Yu, Mahadevan Venkatraman, Munindar P. Singh
Conventional techniques for finding information on the web are time-consuming and often yield unreliable results. This has led to renewed interest in referral systems, which support the interactions...
Wavelet Thresholding for Multiple Noisy Image Copies (2000)
S. Grace Chang, Bin Yu, Martin Vetterli
This work addresses the recovery of an image from its multiple noisy copies. The standard method is to compute the weighted average of these copies. Since the wavelet thresholding technique has been...
Wavelet Thresholding via MDL for Natural Images (2000)
We study the application of Rissanen's Principle of Minimum Description Length (MDL) to the problem of wavelet denoising and compression for natural images. After making a connection between...
Iterated Logarithmic Expansions of the Pathwise Code Lengths for Exponential Families (2000)
Rissanen's Minimum Description Length (MDL) principle is a statistical modeling principle motivated by coding theory. For exponential families we obtain pathwise expansions, to the constant...
Bagging is one of the most eective computationally intensive procedures to improve on instable estimators or classiers, useful especially for high dimensional data set problems. Here we formalize the...
A Social Mechanism of Reputation Management in Electronic Communities (2000)
Bin Yu And, Bin Yu, Munindar P. Singh
. Trust is important wherever agents must interact. We consider the important case of interactions in electronic communities, where the agents assist and represent principal entities, such as people...
Bagging is one of the most eective computationally intensive procedures to improve on instable estimators or classiers, useful especially for high dimensional data set problems. Here we formalize the...
"Comprestimation": Microarray Images in Abundance. (2000)
Microarray analysis is a powerful technique for the classification and identification of gene expressions. Recently significant progress has been made in the development of microarray instruments,...
Bin Yu Mahadevan, Bin Yu, Mahadevan Venkatraman, Munindar P. Singh
Conventional techniques for finding information on the web are time-consuming and often yield unreliable results. This has led to renewed interest in referral systems, which support the interactions...
What Makes Boosting Work in Classification? (2000)
le surrogates because the likelihood equations are intractable. Despite of these similarities, there are fundamental differences between boosting and these statistical procedures. As recounted by...
What Makes Boosting Work in Classification? (2000)
Let Us Consider, Peter Buhlmann, Bin Yu
ons as sensible surrogates because the likelihood equations are intractable. Despite of these similarities, there are fundamental differences between boosting and these statistical procedures. As...
Time-Varying Network Tomography: Router Link Data (2000)
Jin Cao, Drew Davis, Scott Vander Wiel, Bin Yu, Scott Vander, Wiel Bin Yu
The origin-destination (OD) traffic matrix of a computer network is useful for solving problems in design, routing, con guration debugging, monitoring, and pricing. Directly measuring this matrix is...
A Social Mechanism of Reputation Management in Electronic Communities (2000)
. Trust is important wherever agents must interact. We consider the important case of interactions in electronic communities, where the agents assist and represent principal entities, such as people...
Trust and Reputation Management in a Small-World Network (2000)
Mahadevan Venkatraman Bin, Bin Yu, Munindar P. Singh
Successful commerce relies heavily upon the reputations that the different parties acquire through their dealings with each other. We view an e-commerce community as a social network, which supports...
Spatially Adaptive Wavelet Thresholding with Context Modeling for Image Denoising (2000)
S. Grace Chang, Student Member, Bin Yu, Senior Member, Martin Vetterli
The method of wavelet thresholding for removing noise, or denoising, has been researched extensively due to its effectiveness and simplicity. Much of the literature has focused on developing the best...
Umeshita-Suyama, Ritsuko, Sugimoto, Rie, Akaiwa, Mina, Arima, Kazuhiko, Yu, Bin, Wada, Morimasa, ...
IL-4 and IL-13 are pleiotropic cytokines whose biological activities overlap with each other. IL-13 receptor α chain 1 (IL-13Rα1) is necessary for binding to IL-13, and the heterodimer composed of...
Iterated Logarithmic Expansions of the Pathwise Code Lengths for Exponential Families (1999)
Rissanen's Minimum Description Length (MDL) principle is a statistical modeling principle motivated by coding theory. For exponential families we obtain pathwise expansions, to the constant...
A Multiagent Referral System for Expertise Location (1999)
Bin Yu Mahadevan, Bin Yu, Mahadevan Venkatraman, Munindar P. Singh
A referral system is an agent-based framework for assisting and simplifying person-to-person communication, such as finding experts for a specified topic. The informal person-to-person social...
Wavelet Thresholding via MDL: Simultaneous Denoising and Compression (1999)
In the context of wavelet denoising and compression, we study minimum description length (MDL) criteria for model selection criteria as flexible forms of thresholding. Mixture MDL methods based on a...
Wavelet Thresholding via MDL: Simultaneous Denoising and Compression (1999)
Mark Hansen And, Mark Hansen, Bin Yu
In the context of wavelet denoising and compression, we study minimum description length (MDL) criteria for model selection criteria as flexible forms of thresholding. Mixture MDL methods based on a...
Wavelet Thresholding via MDL: Simultaneous Denoising and Compression (1999)
Mark Hansen And, Mark Hansen, Bin Yu
In the context of wavelet denoising and compression, we study minimum description length #MDL# criteria for model selection criteria as #exible forms of thresholding. Mixture MDL methods based on a...
Penalized discriminant analysis of in situ hyperspectral data for conifer species recognition (1999)
Bin Yu, Senior Member, I. Michael Ostl, Peng Gong, Ruiliang Pu
Abstract—Using in situ hyperspectral measurements collected
Spatially Adaptive Wavelet Thresholding with Context Modeling for Image Denoising (1998)
S. Grace Chang, Bin Yu, Martin Vetterli
The method of wavelet thresholding for removing noise, or denoising, has been researched extensively due to its effectiveness and simplicity. Much of the work has been concentrated on finding the...
Model Selection and the Principle of Minimum Description Length (1998)
This paper reviews the principle of Minimum Description Length (MDL) for problems of model selection. By viewing statistical modeling as a means of generating descriptions of observed data, the MDL...
Spatially Adaptive Wavelet Thresholding with Context Modeling for Image Denoising (1998)
S. Grace Chang, Bin Yu, Martin Vetterli
The method of wavelet thresholding for removing noise, or denoising, has been researched extensively due to its effectiveness and simplicity. Much of the literature has focused on developing the best...
Bridging Compression To Wavelet Thresholding As A Denoising Method (1997)
S. Grace Chang, Bin Yu, Martin Vetterli
Some past work has suggested that lossy compression can be a good denoising tool. Building on this theme, we make the connection that quantization of transform coefficients approximates the operation...
Image Denoising Via Lossy Compression And Wavelet Thresholding (1997)
Grace Chang, Bin Yu, Martin Vetterli
Some past work has proposed to use lossy compression to remove noise, based on the rationale that a reasonable compression method retains the dominant signal features more than the randomness of the...
Image Subband Coding Using Progressive Classification and Adaptive Quantization (1997)
Antonio Ortega, Bin Yu, Youngjun Yoo, Youngjun Yoo
Adaptive compression methods have been a key component of many of the recently proposed subband (or wavelet) image coding techniques. This paper deals with a particular type of adaptive subband image...
Adaptive quantization of image subbands with efficient overhead rate selection (1996)
Youngjun Yoo, Antonio Ortega, Bin Yu
bnyust at. berkeley. edu Recent subband image coding techniques owe much of their success to an effective use of adaptive quantization and adaptive entropy coding. It is often the case that adaptive...
A Statistical Analysis of Adaptive Scalar Quantization based on Quantized Past Data (1995)
In this paper, we investigate from several angles the adaptive quantization problem based on quantized causal past data (cf. Ortega and Vetterli, 1994). First, when stationarity and ergodicity are...
Regeneration in Markov Chain Samplers (1994)
Per Mykland, Luke Tierney, Bin Yu
Markov chain sampling has received considerable attention in the recent literature, in particular in the context of Bayesian computation and maximum likelihood estimation. This paper discusses the...
Regeneration in Markov Chain Samplers (1994)
Per Mykland, Luke Tierney, Bin Yu
Markov chain sampling has recently received considerable attention in particular in the context of Bayesian computation and maximum likelihood estimation. This paper discusses the use of Markov chain...
Looking at Markov Samplers through Cusum Path Plots: a simple diagnostic idea (1994)
In this paper, we propose to monitor a Markov chain sampler using the cusum path plot of a chosen 1-dimensional summary statistic. We argue that the cusum path plot can bring out, more effectively...
Looking at Markov samplers through Cusum path plots: a simple diagnostic idea (1994)
In this paper, we propose to monitor a Markov chain sampler using the cusum path plot of achosen 1-dimensional summary statistic. We argue that the cusum path plot can bring out, more e ectively than...
Jin Liang, Bin Yu, Zhenyu Yang, Klara Nahrstedt
IPTV, or Internet-Based TV broadcasting is going to significantly change the way people obtain information, get entertainment, and possibly collaborate with each other. Compared with today’s TV...
Computational prediction of novel non-coding RNAs in Arabidopsis thaliana
Song, Dandan, Yang, Yang, Yu, Bin, Zheng, Binglian, Deng, Zhidong, Lu, Bao-Liang, ...
Adaptive Wavelet Thresholding for Image Denoising and Compression
Grace Chang Bin, Bin Yu, Martin Vetterli
The first part of this paper proposes an adaptive, data-driven threshold for image denoising via wavelet soft-thresholding. The threshold is derived in a Bayesian framework, and the prior used on the...
Cheng, Pingyan, Corzo, Cesar A., Luetteke, Noreen, Yu, Bin, Nagaraj, Srinivas, Bui, Marylin M., ...
Accumulation of myeloid-derived suppressor cells (MDSCs) associated with inhibition of dendritic cell (DC) differentiation is one of the major immunological abnormalities in cancer and leads to...