lenML
diff --git a/‎.github/workflows/release.yaml
Lines changed: 33 additions & 0 deletions b/‎.github/workflows/release.yaml
Lines changed: 33 additions & 0 deletions
diff --git a/‎.gitignore
Lines changed: 3 additions & 0 deletions b/‎.gitignore
Lines changed: 3 additions & 0 deletions
diff --git a/‎README.md
Lines changed: 81 additions & 61 deletions b/‎README.md
Lines changed: 81 additions & 61 deletions
diff --git a/‎package.json
Lines changed: 30 additions & 9 deletions b/‎package.json
Lines changed: 30 additions & 9 deletions
@@ -0,0 +1,33 @@
+name: Manual Release
+
+on:
+  workflow_dispatch:
+
+jobs:
+  build-and-release:
+    runs-on: ubuntu-latest
+
+    steps:
+      - uses: actions/checkout@v3
+
+      - name: Setup Node.js
+        uses: actions/setup-node@v3
+        with:
+          node-version: "18"
+
+      - name: Install dependencies
+        run: |
+          npm install -g pnpm
+          pnpm install
+
+      - name: Build and package
+        run: |
+          chmod +x ./scripts/build.sh
+          ./scripts/build.sh
+
+      - name: Upload Release Assets
+        uses: xresloader/upload-to-github-release@v1
+        env:
+          GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
+        with:
+          file: "gaoas_*.zip"
@@ -131,3 +131,6 @@ dist
 
 # local config file
 genai.config.json
+
+/output
+/gaoas_*.zip
@@ -1,67 +1,87 @@
-# generative-ai-openai-api-server
-converts Gemini API to OpenAI API format.
-
-# release
-WIP
-
-# Why dont using `/v1beta/openai/`?
-首先，其实 google 提供了兼容 open api 的接口，其实简单使用的话，完全可以用 `/v1beta/openai/` 即可。
-但是， `/v1beta/openai/` 有几个问题：
-1. 很多接口不支持，包括 `/v1/models`
-2. 很多参数不支持，包括 `"frequency_penalty", "presence_penalty", "stop"`，并且，不支持的时候是报错，而不是忽略它...
-3. 缺少 gemini api 的高级功能设定，比如 `上下文缓存` `安全设置`
-
-所以，我开发这个简单的服务用来处理这些问题。
-
-# usage
-
-0. 下载 release bin 文件
-1. 修改创建配置文件 (genai.config.json) 
-2. 运行，即可 默认端口为 4949
-
-## configure
-本系统使用json配置文件，配置一个简单的json即可，下面是一个示例
-```json
-{
-  "api_key": "sk-xxx",
-  "server": {
-    "port": 4949
-  }
-}
-```
+# Generative AI OpenAI API Server  
+
+A lightweight server that translates Gemini API calls into OpenAI API-compatible format.  
+
+## Features  
+This project provides an alternative to using Google's `/v1beta/openai/` endpoint by addressing its limitations, offering enhanced functionality, and extending support for key features.  
+
+### Why not use `/v1beta/openai/` directly?  
+While Google does provide a partially OpenAI-compatible API, there are significant limitations:  
+1. **Unsupported Endpoints**: Many endpoints, such as `/v1/models`, are not available.  
+2. **Limited Parameters**: Important parameters like `"frequency_penalty"`, `"presence_penalty"`, and `"stop"` are not supported. When unsupported parameters are included, the API throws an error instead of gracefully ignoring them.  
+3. **Missing Advanced Features**: Features like context caching and advanced safety configurations from Gemini API are absent.  
+
+This server addresses these issues by acting as a middleware between your application and the Gemini API.  
+
 
-完整可配置参数如下:
-```ts
-type Params = {
-  api_key: string;
-  server?: {
-    port?: number;
-  };
-  no_docs?: boolean;
-  retry?: {
-    enabled?: boolean;
-    retries?: number;
-    factor?: number;
-    minTimeout?: number;
-    maxTimeout?: number;
-  };
-  debug?: {
-    stream?: {
-      log?: boolean;
-    };
-  };
-}
+## Getting Started  
+
+### Prerequisites  
+- Download the latest release binary.  
+
+### Steps to Use  
+1. Create a configuration file (`genai.config.json`).  
+2. Run the server. The default port is `4949`.  
+
+#### Custom Config
 ```
+main.exe -c my_owner.config.json
+```
+
+## Configuration  
 
-# support endpoints
+The server uses a JSON-based configuration file. Below is a basic example:  
+```json  
+{  
+  "api_key": "sk-xxx",  
+  "server": {  
+    "port": 4949  
+  }  
+}  
+```  
+
+### Full Configuration Options  
+Here is a complete list of configurable parameters:  
+```ts  
+type Params = {  
+  api_key: string;  
+  server?: {  
+    port?: number;  
+  };  
+  no_docs?: boolean;  
+  retry?: {  
+    enabled?: boolean;  
+    retries?: number;  
+    factor?: number;  
+    minTimeout?: number;  
+    maxTimeout?: number;  
+  };  
+  debug?: {  
+    stream?: {  
+      log?: boolean;  
+    };  
+  };  
+};  
+```  
+
+
+## Supported Endpoints  
+
+The server currently supports the following endpoints:  
+- **`/v1/models`**: Retrieve available model list.  
+- **`/v1/embeddings`**: Generate vector embeddings for input text.  
+- **`/v1/chat/completions`**: Chat-based text completions.  
+
+> **Note**: `/v1/completions` is not supported because Gemini models do not support completion functionality, and Google's PaLM model (which does) is likely to be deprecated.  
+
+
+## Building the Project  
+
+```
+pnpm run build:ci
+```
 
-- `/v1/models`: 获取模型列表
-- `/v1/embeddings`: 文本向量化
-- `/v1/chat/completions`: chat文本补全
-- ~~`/v1/completions`~~ (难以支持，因为 gemini 系列模型都不支持 completion)
 
-# How to build?
-WIP
+## License  
 
-# LICENSE
-MIT
+This project is licensed under the **MIT License**.  
@@ -1,26 +1,47 @@
 {
   "scripts": {
-    "dev": "tsx src/main.ts"
+    "dev": "tsx src/main.ts",
+    "build:ci": "bash ./scripts/build.sh",
+    "build": "rollup -c rollup.config.mjs",
+    "build:windows": "bash ./scripts/build-bin.sh ./output/windows/main.exe windows-x64",
+    "build:linux": "bash ./scripts/build-bin.sh ./output/linux/main.linux linux-x64",
+    "build:mac": "bash ./scripts/build-bin.sh ./output/mac/main.mac mac-x64"
   },
   "dependencies": {
-    "@fastify/cors": "^10.0.1",
-    "@fastify/swagger": "^9.4.0",
-    "@fastify/swagger-ui": "^5.1.0",
+    "@fastify/cors": "^8.5.0",
+    "@fastify/swagger": "^8.15.0",
+    "@fastify/swagger-ui": "^2.1.0",
     "@google/generative-ai": "^0.21.0",
-    "@types/node": "^22.10.1",
     "async-retry": "^1.3.3",
     "commander": "^12.1.0",
     "dotenv": "^16.4.5",
-    "fastify": "^5.1.0",
+    "esbuild-plugin-copy": "^2.1.1",
+    "fastify": "^4.29.0",
     "fastify-type-provider-zod": "^4.0.2",
     "file-type": "^19.6.0",
     "file-type-cjs": "^1.0.7",
+    "magic-string": "^0.30.14",
     "node-fetch": "^3.3.2",
     "proxy-agent": "^6.4.0",
+    "rollup-plugin-copy": "^3.5.0",
+    "rollup-plugin-esbuild": "^6.1.1",
+    "rollup-plugin-typescript-paths": "^1.5.0",
+    "undici": "^5.28.4",
+    "zod": "^3.23.8"
+  },
+  "devDependencies": {
+    "@babel/preset-env": "^7.26.0",
+    "@rollup/plugin-commonjs": "^28.0.1",
+    "@rollup/plugin-json": "^6.1.0",
+    "@rollup/plugin-node-resolve": "^15.3.0",
+    "@rollup/plugin-replace": "^6.0.1",
+    "@rollup/plugin-typescript": "^12.1.1",
+    "@types/node": "^22.10.1",
+    "esbuild": "^0.24.0",
+    "nexe": "4.0.0-rc.6",
+    "rollup": "^4.28.0",
     "tslib": "^2.8.1",
     "tsx": "^4.19.2",
-    "typescript": "^5.7.2",
-    "undici": "^7.0.0",
-    "zod": "^3.23.8"
+    "typescript": "^5.7.2"
   }
 }