National Changhua University of Education Institutional Repository : Item 987654321/8862
English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 6507/11669
造訪人次 : 29922449      線上人數 : 407
RC Version 3.2 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 進階搜尋

請使用永久網址來引用或連結此文件: http://ir.ncue.edu.tw/ir/handle/987654321/8862

題名: A Multivariate Decision Tree Algorithm to Mine Imbalanced Data
作者: Tsai, Cheng-Jung;Lee, Chien-I Lee;Chen, Chiu-Ting;Yang, Wei-Pang
貢獻者: 數學系
關鍵詞: Classification;Data mining;Decision tree;Imbalance;Multivate test
日期: 2007-01
上傳時間: 2011-05-10T06:29:32Z
出版者: World Scientific and Engineering Academy and Society (WSEAS)
摘要: The class imbalance problem is an important issue in classification of Data mining. Among the proposed approaches, some of them modify the class distribution of the original data which would worsen the computational burden or might throw away some userful information; some are limited to specific dataset or only applicable to the dataset with numeric attribute; some would take a lot of training time due to the natural property of core techniques such as neural network; and some suffer from determining a proper threshold while the user is unfamiliar with the domain knowledge. In this paper, we proposed the HIerarchical Shrinking decision Tree (HIS-Tree) algorithm to solve these problems. HIS-Tree uses the multivariae test derived from geometric mean measurement as splitting criteria to group minority examples together. By this way, HIS-Tree can avoid discovering rules dominated by the majority examples. Finally, as shown in the experiment, HIS-Tree can predict minority/interesting examples more accurately.
關聯: WSEAS Transactions on Information Science and applications, 4(1):50-58
顯示於類別:[數學系] 期刊論文

文件中的檔案:

沒有與此文件相關的檔案.



在NCUEIR中所有的資料項目都受到原著作權保護.

 


DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 回饋