Simple dataset convertor in Python. Currently memory hungry in case of large datasets.
Install package via pip
:
$ pip install convertme
Use the provided CLI:
$ convertme --help
Usage: convertme [OPTIONS]
Options:
-if, --input-format [csv|fimi|cxt|mat|cex]
[required]
-of, --output-format [csv|fimi|cxt|mat|cex]
[required]
-i, --input TEXT Input file, skip it for stdin.
-o, --output TEXT Output file, skip it for stdout.
--input-delimiter TEXT (CSV) Delimiter of input. [default: ,]
--output-delimiter TEXT (CSV) Delimiter of output. [default: ,]
--objects-col INTEGER (CSV) Index of column with object labels,
typically 0, ignored on default.
--attributes-row INTEGER (CSV) Index of row with attribute labels,
typically 0, ignored on default.
--true-values TEXT (CSV) Values which will be count as True,
comma separated.
--help Show this message and exit.
Convert simple csv
file to fimi
format:
$ convertme -i dataset.csv -if=csv -o dataset.fimi -of=fimi
Content of dataset.csv
:
1,0,1,0
0,1,0,1
Content of dataset.fimi
:
0 2
1 3
- csv
- burmeister (.cxt)
- fimi
- matlab (version<=7.3)
- conexp (.cex)
Clone this repository to the folder, then:
# create virtualenv (optional)
$ mkvirtualenv convertme -p python3
#if is not actived (optional)
$ workon convertme
$ pip install -e .
$ python setup.py test