Skip to content

tum-db/ml2sql

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ML Compiler

A Compiler that compiles a declarative language for machine learning in either Hyperscript, Postgresql, or Python


Dependencies

Project generation and build:

  • (CMake and Make) or only Make

Compiler with C++11 support:

  • g++ 5.5.0

Libraries:

  • ANTLR4
  • csvkit (install by "sudo apt install python3-csvkit")

Building

Using Cmake and Make (recommended):

mkdir build && cd build
cmake ..
make

Stores executable in bin folder.


Running the Compiler

Run the following to run the compiler:

./bin/compiler <inputFile> <outputFile> <targetLanguage> <flags>

The target language can be either postgres, hyper, or python. "flags" can consist of either "buildcsv", "usemod" or both.

Since Hyper and Postgres can only load csv files in the correct format, the flag "buildcsv" allows the compiler the create a new modifeid csv that can be processed by the database system. (The new csv file created has the file ending .ml.csv)

To allow the compiler to create code that uses the newly created csv file that was created by using the "buildcsv" flag.

Notice, you may use the flag "buildcsv" only when you compil code the first time or have made any changes to the csv file, since using the "buildcsv" takes some time to precess the csv file.


Installing ANTLR4

To install ANTRL4 run the following commands in terminal:

wget http://www.antlr.org/download/antlr4-cpp-runtime-4.7.1-source.zip
wget http://www.antlr.org/download/antlr-4.7.1-complete.jar
sudo mv antlr-4.7.1-complete.jar /usr/local/lib/antlr-4.7.1-complete.jar
sudo apt-get install uuid-dev -y
sudo apt install cmake -y
unzip antlr4-cpp-runtime-4.7.1-source.zip -d antlr4-cpp-runtime
cd antlr4-cpp-runtime && mkdir build && mkdir run && cd build
cmake ..
make -j8
sudo make install
LD_LIBRARY_PATH=/usr/local/lib
export LD_LIBRARY_PATH
sudo ldconfig -v

Server

The server accepts JSON data of the following form (default port 5000):

{"lang": "python", "code": "A=[[2]] \n print '%',A"}

Possible destination languages are {python, postgres, hyper} (case insensitive) The answer looks like:

{"lang": "PYTHON","result": "import numpy as np


def main():

    A = np.array([ np.array([ 2])])
    print( '{}'.format(  A))

main()
"}

Dockerfile

To run ml2sql in a docker container

docker build -t ml2sql .
docker run --rm -p 5000:5000 ml2sql

About

A meta language to translate either into Python or PL/pgSQL.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published