Results 11 - 20
of
1,680
Clustering Massive Datasets
, 1998
"... Clustering data is not an easy problem in general, and is compounded for a massive dataset. Restricting attention to a sample from the data ignores minority groups and hence compromises on the available riches. This paper develops, under Gaussian assumptions, a multistage sequential algorithm. After ..."
Abstract
- Add to MetaCart
Clustering data is not an easy problem in general, and is compounded for a massive dataset. Restricting attention to a sample from the data ignores minority groups and hence compromises on the available riches. This paper develops, under Gaussian assumptions, a multistage sequential algorithm
Economic Growth and Subjective WellBeing: Reassessing the
- Easterlin Paradox.” IZA Discussion Paper 3654, Institute for the Study of Labor
, 2008
"... www.nber.org/~jwolfers The “Easterlin Paradox ” suggests that there is no link between the level of economic development of a society and average levels of happiness. We return to Easterlin’s question: “Will raising the incomes of all increase the happiness of all? ” and analyze multiple rich datase ..."
Abstract
-
Cited by 159 (0 self)
- Add to MetaCart
www.nber.org/~jwolfers The “Easterlin Paradox ” suggests that there is no link between the level of economic development of a society and average levels of happiness. We return to Easterlin’s question: “Will raising the incomes of all increase the happiness of all? ” and analyze multiple rich
Automatically Annotating and Integrating Spatial Datasets
- In Proceedings of the International Symposium on Spatial and Temporal Databases, Santorini Island
, 2003
"... Recent growth of the geo-spatial information on the web has made it possible to easily access a wide variety of spatial data. By integrating these spatial datasets, one can support a rich set of queries that could not have been answered given any of these sets in isolation. However, accurately in ..."
Abstract
-
Cited by 20 (9 self)
- Add to MetaCart
Recent growth of the geo-spatial information on the web has made it possible to easily access a wide variety of spatial data. By integrating these spatial datasets, one can support a rich set of queries that could not have been answered given any of these sets in isolation. However, accurately
Pedestrian detection: A benchmark
- In CVPR
, 2009
"... Pedestrian detection is a key problem in computer vision, with several applications including robotics, surveillance and automotive safety. Much of the progress of the past few years has been driven by the availability of challenging public datasets. To continue the rapid rate of innovation, we intr ..."
Abstract
-
Cited by 140 (11 self)
- Add to MetaCart
introduce the Caltech Pedestrian Dataset, which is two orders of magnitude larger than existing datasets. The dataset contains richly annotated video, recorded from a moving vehicle, with challenging images of low resolution and frequently occluded people. We propose improved evaluation metrics
Forced to be Rich? Returns to Compulsory Schooling
- in Britain”, IZA Discussion Paper #3305
, 2008
"... Do students benefit from compulsory schooling? In an important article, Oreopoulos (2006) studied the 1947 British compulsory schooling law change and found large returns to schooling of about 15 % using the General Household Survey (GHS). We reanalyse this dataset and find much smaller returns of a ..."
Abstract
-
Cited by 43 (6 self)
- Add to MetaCart
Do students benefit from compulsory schooling? In an important article, Oreopoulos (2006) studied the 1947 British compulsory schooling law change and found large returns to schooling of about 15 % using the General Household Survey (GHS). We reanalyse this dataset and find much smaller returns
Compositional mining of multirelational biological datasets
- In ACM Trans. Knowl. Discov. Data
, 2008
"... High-throughput biological screens are yielding ever-growing streams of information about multiple aspects of cellular activity. As more and more categories of datasets come online, there is a corresponding multitude of ways in which inferences can be chained across them, motivating the need for com ..."
Abstract
-
Cited by 7 (5 self)
- Add to MetaCart
High-throughput biological screens are yielding ever-growing streams of information about multiple aspects of cellular activity. As more and more categories of datasets come online, there is a corresponding multitude of ways in which inferences can be chained across them, motivating the need
The multidimensional wisdom of crowds
- In In Proc. of NIPS
, 2010
"... Distributing labeling tasks among hundreds or thousands of annotators is an increasingly important method for annotating large datasets. We present a method for estimating the underlying value (e.g. the class) of each image from (noisy) annotations provided by multiple annotators. Our method is base ..."
Abstract
-
Cited by 147 (6 self)
- Add to MetaCart
Distributing labeling tasks among hundreds or thousands of annotators is an increasingly important method for annotating large datasets. We present a method for estimating the underlying value (e.g. the class) of each image from (noisy) annotations provided by multiple annotators. Our method
Eigenbehaviors: Identifying Structure in Routine
- IN PROC. OF UBICOMP’06
, 2006
"... In this work we identify the structure inherent in daily human behavior with models that can accurately analyze, predict and cluster multimodal data from individuals and groups. We represent this structure by the principal components of the complete behavioral dataset, a set of characteristic vecto ..."
Abstract
-
Cited by 144 (7 self)
- Add to MetaCart
In this work we identify the structure inherent in daily human behavior with models that can accurately analyze, predict and cluster multimodal data from individuals and groups. We represent this structure by the principal components of the complete behavioral dataset, a set of characteristic
From the cradle to the labor market? the effect of birth weight on adult outcomes.
- The Quarterly Journal of Economics,
, 2007
"... Abstract Lower birth weight babies have worse outcomes, both short-run in terms of oneyear mortality rates and longer run in terms of educational attainment and earnings. However, recent research has called into question whether birth weight itself is important or whether it simply reflects other h ..."
Abstract
-
Cited by 140 (11 self)
- Add to MetaCart
hard-to-measure characteristics. By applying within twin techniques using an unusually rich dataset from Norway, we examine the effects of birth weight on both short-run and long-run outcomes for the same cohorts. We find that birth weight does matter; despite short-run twin fixed effects estimates
A Hybrid Discriminative/Generative Approach for Modeling Human Activities
- 19th International Joint Conference on Artificial Intelligence (IJCAI
, 2005
"... Accurate recognition and tracking of human activities is an important goal of ubiquitous computing. Recent advances in the development of multi-modal wearable sensors enable us to gather rich datasets of human activities. However, the problem of automatically identifying the most useful features for ..."
Abstract
-
Cited by 138 (16 self)
- Add to MetaCart
Accurate recognition and tracking of human activities is an important goal of ubiquitous computing. Recent advances in the development of multi-modal wearable sensors enable us to gather rich datasets of human activities. However, the problem of automatically identifying the most useful features
Results 11 - 20
of
1,680