Main | Browse | Search | Author Links | Manage ETD List | Review ETDs | Catalog ETDs | Help
 

Title page for ETD etd-07082009-100035


Type of Document Dissertation
Author Cieslak, David Alan
Author's Email Address dcieslak@nd.edu
URN etd-07082009-100035
Title Finding Problems In, Proposing Solutions to, and Performing Analysis on Imbalanced Data
Degree Doctor of Philosophy
Department Computer Science and Engineering
Advisory Committee
Advisor Name Title
Nitesh Chawla Committee Chair
Aaron Striegel Committee Member
Kevin Bowyer Committee Member
Pat Flynn Committee Member
Keywords
  • Machine Learning
  • Data Mining
Date of Defense 2009-06-10
Availability unrestricted
Abstract
The data mining and machine learning research communities have focused on developing specialized

algorithms and methods to handle a multitude of potential complications which may confront the traditional supervised learning task. Of these, class imbalance is among the most persistent in real-world applications. Less attention has been garnered for the set of problems in which the data distribution changes, potentially wiping out the gains from expensive data mining methods. To an even lesser degree has the combination of problems been considered. It is the purpose of this dissertation to explore concepts of distributional

change, particularly within the context of imbalanced data problems and the effects of the performance on solutions from this realm. Based on this exploration, the proposed dissertation will derive methods to identify and handle both problems simultaneously.

Files
  Filename       Size       Approximate Download Time (Hours:Minutes:Seconds) 
 
 28.8 Modem   56K Modem   ISDN (64 Kb)   ISDN (128 Kb)   Higher-speed Access 
  CieslakD072009.pdf 3.40 Mb 00:15:44 00:08:05 00:07:05 00:03:32 00:00:18

Browse All Available ETDs by ( Author | Department )

If you have more questions or technical problems, please Contact the Graduate School.