Understanding and Adapting Tree Ensembles: A Training Data Perspective

dc.contributor.advisorLowd, Daniel
dc.contributor.authorBrophy, Jonathan
dc.date.accessioned2023-03-24T18:55:52Z
dc.date.available2023-03-24T18:55:52Z
dc.date.issued2023-03-24
dc.description.abstractDespite the impressive success of deep-learning models on unstructured data (e.g., images, audio, text), tree-based ensembles such as random forests and gradient-boosted trees are hugely popular and remain the preferred choice for tabular or structured data, and are regularly used to win challenges on data-competition websites such as Kaggle and DrivenData. Despite their impressive predictive performance, tree-based ensembles lack certain characteristics which may limit their further adoption, especially for safety-critical or privacy-sensitive domains such as weather forecasting or predictive medical modeling. This dissertation investigates the shortcomings currently facing tree-based ensembles---lack of explainable predictions, limited uncertainty estimation, and inefficient adaptability to changes in the training data---and posits that numerous improvements to tree-based ensembles can be made by analyzing the relationships between the training data and the resulting learned model. By studying the effects of one or many training examples on tree-based ensembles, we develop solutions for these models which (1) increase their predictive explainability, (2) provide accurate uncertainty estimates for individual predictions, and (3) efficiently adapt learned models to accurately reflect updated training data. This dissertation includes previously published coauthored material.en_US
dc.identifier.urihttps://hdl.handle.net/1794/28085
dc.language.isoen_US
dc.publisherUniversity of Oregon
dc.rightsAll Rights Reserved.
dc.subjectgradient-boosted treesen_US
dc.subjectinfluence estimationen_US
dc.subjectmachine unlearningen_US
dc.subjectrandom forestsen_US
dc.subjecttree ensemblesen_US
dc.subjectuncertainty estimationen_US
dc.titleUnderstanding and Adapting Tree Ensembles: A Training Data Perspective
dc.typeElectronic Thesis or Dissertation
thesis.degree.disciplineDepartment of Computer and Information Science
thesis.degree.grantorUniversity of Oregon
thesis.degree.leveldoctoral
thesis.degree.namePh.D.

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Brophy_oregon_0171A_13478.pdf
Size:
11.81 MB
Format:
Adobe Portable Document Format