Search Paper
  • Home
  • Login
  • Categories
  • Post URL
  • Academic Resources
  • Contact Us

 

CLOHUI: AN EFFICIENT ALGORITHM FOR MINING CLOSED+ HIGH UTILITY ITEMSETS FROM TRANSACTION DATABASES

google+
Views: 178                 

Author :  Shiming Guo

Affiliation :  School of Computer Science, Harbin Institute of Technology

Country :  China

Category :  Computer Science & Information Technology

Volume, Issue, Month, Year :  Vol 8, No 5, October, 2016

Abstract :


High-utility itemset mining (HUIM) is an important research topic in data mining field and extensive algorithms have been proposed. However, existing methods for HUIM present too many high-utility itemsets (HUIs), which reduces not only efficiency but also effectiveness of mining since users have to sift through a large number of HUIs to find useful ones. Recently a new representation, closed+ high-utility itemset (CHUI), has been proposed. With this concept, the number of HUIs is reduced massively. Existing methods adopt two phases to discover CHUIs from a transaction database. In phase I, an itemset is first checked whether it is closed. If the itemset is closed, an overestimation technique is adopted to set an upper bound of the utility of this itemset in the database. The itemsets whose overestimated utilities are no less than a given threshold are selected as candidate CHUIs. In phase II, the candidate CHUIs generated from phase 1 are verified through computing their utilities in the database. However, there are two problems in these methods. 1) The number of candidate CHUIs is usually very huge and extensive memory is required. 2) The method computing closed itemsets is time consuming. Thus in this paper we propose an efficient algorithm CloHUI for mining CHUIs from a transaction database. CloHUI does not generate any candidate CHUIs during the mining process, and verifies closed itemsets from a tree structure. We propose a strategy to make the verifying process faster. Extensive experiments have been performed on sparse and dense datasets to compare CloHUI with the state-of-the-art algorithm CHUD, the experiment results show that for dense datasets our proposed algorithm CloHUI significantly outperforms CHUD: it is more than an order of magnitude faster, and consumes less memory.

Keyword :  Closed+ high-utility itemsets, pattern growth, utility mining

Journal/ Proceedings Name :  Computer Science and Information Technology

URL :  http://aircconline.com/ijcsit/V8N5/8516ijcsit08.pdf

User Name : srav
Posted 30-06-2017 on 09:55:49 AEDT



Related Research Work

  • Matchcut Assist: A Mobile System To Automate Matchcut Process Using Computer Vision
  • An Adaptive And Smart System For Parental Control On Digital Games
  • Intelligent System For Solving Problems Of Veterinary Medicine On The Example Of Dairy Farms
  • A Diet Control And Fitness Assistant Application Using Deep Learning-based Image Classification

About Us | Post Cfp | Share URL Main | Share URL category | Post URL
All Rights Reserved @ Call for Papers - Conference & Journals