Skip to main navigation Skip to search Skip to main content

Fast nonparametric conditional density estimation

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Conditional density estimation generalizes regression by modeling a full density f(y|x) rather than only the expected value E(y|x). This is important for many tasks, including handling multi-modality and generating prediction intervals. Though fundamental and widely applicable, nonparametric conditional density estimators have received relatively little attention from statisticians and little or none from the machine learning community. None of that work has been applied to greater than bivariate data, presumably due to the computational difficulty of data-driven bandwidth selection. We describe the double kernel conditional density estimator and derive fast dual-tree-based algorithms for bandwidth selection using a maximum likelihood criterion. These techniques give speedups of up to 3.8 million in our experiments, and enable the first applications to previously intractable large multivariate datasets, including a redshift prediction problem from the Sloan Digital Sky Survey.

Original languageEnglish (US)
Title of host publicationProceedings of the 23rd Conference on Uncertainty in Artificial Intelligence, UAI 2007
Pages175-182
Number of pages8
StatePublished - 2007
Externally publishedYes
Event23rd Conference on Uncertainty in Artificial Intelligence, UAI 2007 - Vancouver, BC, Canada
Duration: Jul 19 2007Jul 22 2007

Publication series

NameProceedings of the 23rd Conference on Uncertainty in Artificial Intelligence, UAI 2007

Conference

Conference23rd Conference on Uncertainty in Artificial Intelligence, UAI 2007
Country/TerritoryCanada
CityVancouver, BC
Period7/19/077/22/07

ASJC Scopus subject areas

  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Fast nonparametric conditional density estimation'. Together they form a unique fingerprint.

Cite this