Skip to content

A ready-to-deploy container for implementing an easy to use REST API to access Language Models.

License

Notifications You must be signed in to change notification settings

hitomi-team/sukima

Repository files navigation

logo

Overview

Sukima is a ready-to-deploy container that implements a REST API for Language Models designed with the specific purpose of easy deployment and scalability.

Curent API Functions

  • models : Fetch a list of ready-to-use Language Models for inference.
  • load : Allocate a Language Model.
  • generate : Use a Language Model to generate tokens.
  • classify : Use a Language Model to classify tokens and retrieve scores.

To view more information for API Usage, see /docs endpoint.

Setup

Setup Guide

Usage Guide

Todo

  • Autoscaling
  • HTTPS Support
  • Rate Limiting
  • Support for other Language Modeling tasks such as Sentiment Analysis and Named Entity Recognition.

License

GPL-2.0

About

A ready-to-deploy container for implementing an easy to use REST API to access Language Models.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Languages