Skip to content

juliendelplanque/Arff

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 

Repository files navigation

Arff

An Arff generator written in Pharo.

Arff is a format defined by Weka to be used for data importation. This project implements a representation of weka's arff concepts as objects in Pharo and is able to generate valid arff strings.

See Arff file spec to learn more about this format.

Install

You can install this project using the Catalog Browser or by evaluating the following code in a fresh image:

Metacello new
    baseline: 'Arff';
    repository: 'github://juliendelplanque/Arff/repository';
    load

Use this project as a dependency

Simply add this code snippet to your baseline:

spec baseline: 'Arff' with: [
    spec repository: 'github://juliendelplanque/Arff/repository' ].

Example

The following code:

doc := ArffDocument new.
doc
    relation: 'temperature';
    addDateAttribute: 'time' format: 'yyyy-MM-dd HH:mm:ss';
    addNumericAttribute: 'temperature';
    addNominalAttribute: 'weather' specification: #('sunny' 'cloudy').

doc
    addInstance: #('2015-01-01 12:00:00' 5  'sunny');
    addInstance: #('2015-02-01 18:40:00' 8  'sunny');
    addInstance: #('2015-03-01 05:04:00' 3  'cloudy');
    addInstance: #('2015-04-01 13:01:20' 15 'cloudy');
    addInstance: #('2015-05-01 09:07:00' 12  'sunny');
    addInstance: #('2015-06-01 12:20:00' 6  'cloudy').

doc asString

Generates the arff string:

@relation temperature
@attribute time date "yyyy-MM-dd HH:mm:ss"
@attribute temperature numeric
@attribute weather {sunny,cloudy}
@data
"2015-01-01 12:00:00",5,"sunny"
"2015-02-01 18:40:00",8,"sunny"
"2015-03-01 05:04:00",3,"cloudy"
"2015-04-01 13:01:20",15,"cloudy"
"2015-05-01 09:07:00",12,"sunny"
"2015-06-01 12:20:00",6,"cloudy"

TODO

  • Add messages in ArffDocument to make the DSL simpler.
  • Add possibility to add comment in an ArffDocument.

Pull requests are welcome. :)