Go to file
Gregory Martin 910b138668 Initial commt 2017-11-06 11:39:57 +01:00
lib Initial commt 2017-11-06 11:39:57 +01:00
src/main/java Initial commt 2017-11-06 11:39:57 +01:00
.gitignore Initial commt 2017-11-06 11:39:57 +01:00
DMV.iml Initial commt 2017-11-06 11:39:57 +01:00
README.md Initial commt 2017-11-06 11:39:57 +01:00
data.7z Initial commt 2017-11-06 11:39:57 +01:00
pom.xml Initial commt 2017-11-06 11:39:57 +01:00

README.md

DMVHomework - Note to the teacher

Three data mining algorithms have been used for this homework: Apriori, LCM and BIDE+.
Those three algorithms have each a separate class which are located in the package algorithm with their respective name.
Each class has a main method which is able to launch either one instance of the selected algorithm or an experiment on this algorithm.

In order to give inputs to those algorithms, the class main.DatasetConverter is able to take the raw dataset and format it in the good format for Apriori and LCM (.transaction) and for BIDE+ (.sequence).
For BIDE+, since the raw dataset contains named items, the formatted .sequence will have only numbers in it. A file with the same name and an extension .seqinfo gives the association between the name of the item and its ID.

To explore the patterns returned by those algorithms, several methods in the class main.DataExplorer helps to choose meaningful patterns.