Rough set discretization: equal frequency binning, entropy/MDL and semi naives algorithms of intrusion detection system
Discretization of real value attributes is a vital task in data mining, particularly in the classification problem. Discretization part is also the crucial part resulting the good classification. Empirical results have shown that the quality of classification methods depends on the discretization al...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Digital Information Research Foundation, India
2017
|
Subjects: | |
Online Access: | http://umpir.ump.edu.my/id/eprint/21203/ http://umpir.ump.edu.my/id/eprint/21203/ http://umpir.ump.edu.my/id/eprint/21203/1/Rough%20Set%20Discretization%20Equal%20Frequency%20Binning%2C%20Entropy%20MDL%20and%20Semi%20Naives.pdf |
Summary: | Discretization of real value attributes is a vital task in data mining, particularly in the classification problem. Discretization part is also the crucial part resulting the good classification. Empirical results have shown that the quality of classification methods depends on the discretization algorithm in preprocessing step. Universally, discretization is a process of searching for partition of attribute domains into intervals and unifying the values over each interval. Significant discretization technique suit to the Intrusion Detection System (IDS) data need to determine in IDS framework, since IDS data consist of huge records that need to be examined in system. There are many Rough Set discretization technique that can be used, among of them are Semi Naives and Equal Frequency Binning. |
---|