🗺️ Keras Development Roadmap #19519

fchollet · 2024-04-15T19:23:13Z

Here's an overview of the features we intend to work on in the near future, across Core Keras, KerasNLP, and KerasCV.

Core Keras

Saving & export

[Open for Contributions] Add utility for saved Keras model file inspection, diff, and patching.
Implement saving support for sharded models (sharded weights files).
Improve model export story:
- [Open for Contributions] Implement export to ONNX (from any backend)
- [Open for Contributions] Implement export to TensorRT (from any backend)
- [Open for Contributions] Write serving/export guide explaining the different servings options (SavedModel, ONNX, TensorRT, TorchServe, etc.)

Distribution

Add PyTorch SPMD support in keras.distribution.
Add TensorFlow SPMD (DTensor) support in keras.distribution.

Performance

Enable PyTorch compilation on any Keras model.
Also make it possible to provide torch compilation arguments to Keras.
[Open for Contributions] Add a profiler Callback. With any backend, this callback should be able to give you a report on the computation/time breakdown of your layers and your train step.
[Open for Contributions] Add ops.flash_attention ops for Pytorch and JAX (implemented via Pallas).
✅ Add Float8 training.
✅ Add LoRA support to key layer types (Dense, EinsumDense, Conv layers)
✅ Enable QLoRA
✅ Enable int8 inference for key layer types (Dense, EinsumDense)
✅ Enable float8 training for key layer types (Dense, EinsumDense)

Modeling

✅ Bring back deeply nested Functional inputs/outputs. Right now it is enforced that the inputs and outputs argument in Model should be flat structured (e.g. flat dicts or lists). However it may be useful to support e.g. dicts of lists, or dicts of dicts.
✅ Add support for optional inputs in Functional models.
[Open for Contributions] Increase coverage of NumPy API in keras.ops.
- Any function from the NumPy API is in scope for inclusion. If you notice a missing op that you need, please open a PR!
- To prioritize:
  - ✅ ops.argpartition
  - ✅ ops.vectorize
  - ✅ ops.select
  - ✅ ops.eigh
Add tensor.at operator.
Add sparse support to PyTorch backend.
Increase coverage of sparse support in keras.ops:
- Reduction ops
- Boolean ops
[Open for Contributions] Add multi-backend scan operation in the style of JAX.

Ecosystem

Add integration with Hugging Face to enable one-line upload/download of .keras models.
[Open for Contributions] Finish implementing the MLX backend, in collaboration with Apple (see mlx branch).
Add OpenVINO inference-only backend, in collaboration with Intel.

Code health

✅ Check in the output of pip_build.py to make the Keras package runnable from its sources without any postprocessing.

Keras "starter packs"

[Open for Contributions] Create a Recommender System "starter packer": a standalone repository containing concise, readable, customizable code to implement a recommender system with Keras 3 (supporting all backends), including model code, data loading code, training code, and model export for serving.
[Open for Contributions] Large-scale LLM training starter pack using keras.distribution API and JAX backend.

Keras Models

Unify KerasCV and KerasNLP into a single Keras Models package.

KerasCV

Add an integration with Hugging Face to enable one-line upload of fine-tuned models to Hugging Face Hub.
Make StableDiffusion model conform to from_preset API.
Add support for channels_first format in all models. Necessary for PyTorch performance.
Also add channels_first testing on CI.
Add standard image classifiers to KerasCV
- All Keras Applications models should be brought to KerasCV.
- Add more pretrained weights to standard image classifiers, in particular OpenImages.
  - Right now we only have ImageNet
Drop Keras 2 support.
[Open for Contributions] Scrub TF references from docstrings.
Scrub Waymo-specific C++ code.
[Open for Contributions] Add new models:
- OCR
- CoCa
- Faster RCNN
- YOLOv8 Instance segmentation
- VVIT
- Grounding Dino
- Visual Q&A task model with CLIP
- StableDiffusion v3
- Mask RCNN
- YOLOv9
- Depth anything
- layoutlmv3

KerasNLP

Add an integration with Hugging Face to enable one-line upload of fine-tuned models to Hugging Face Hub.
Remove dependency on tf-text for KerasNLP. Reimplement in Python.
Add support for dynamic sequence length inference for PyTorch LLMs in KerasNLP, to bridge the performance gap with Hugging Face.
Add support for QLoRA to select KerasNLP models.
Drop Keras 2 support.
Add FlashAttention support for select models.
[Open for Contributions] Add new models:
- Llama2, 3
- Falcon
- Mixtral
- Bloom
- Flan
- MPT
- Vicuna

The text was updated successfully, but these errors were encountered:

pure-rgb · 2024-04-20T09:07:23Z

Unify KerasCV and KerasNLP into a single Keras Models package.

does it mean that kerascv and kerasnlp will be merged together into one single package, like tensorflow model garden?
kerascv development is terribly slow, and lacks of resources. no release since february (0.8.2), may models doesn't have weight, i.e. vit, mit. Object detection models that is added (yolo-v8, retina-net, faster-rcnn) all are broken and doesn't have evaluation scripts to holds their validity. Generative model only stable diffusion, where many (too many impactfull models are available now).
like huggigface, why enigneer or researcher are not hired and reinformce into this package. Check this repo, the ownver can add any new models written in keras. so why not official keras team? Is it because the api desgin is too rich and complicated for contributor? Not to mention, many reviews are pending, contributor left at th end.

IMvision12 · 2024-04-26T16:19:10Z

I concur with @pure-rgb's observation that the API design has become overly elaborate and intricate for contributors. Previously, it was straightforward and comprehensible, but with the introduction of the Backbone API, it has become more convoluted.

pure-rgb · 2024-04-27T12:47:26Z

with the introduction of the Backbone API, it has become more convoluted.

The person who introduced such API design already left google. :p

kernel-loophole · 2024-05-02T14:15:06Z

@fchollet I've been going through the features you're planning to implement, and I'm particularly interested in contributing to KerasNLP. Specifically, I'm eager to get involved in the development of dynamic sequence length inference for PyTorch LLMs.

google-ml-butler bot added the type:performance label Apr 15, 2024

github-actions bot assigned sachinprasadhs Apr 15, 2024

fchollet pinned this issue Apr 15, 2024

fchollet changed the title ~~Keras Development Roadmap~~ 🗺️ Keras Development Roadmap Apr 15, 2024

fchollet removed the type:performance label Apr 15, 2024

fchollet unassigned sachinprasadhs Apr 15, 2024

martin-gorner mentioned this issue Apr 30, 2024

[DO NOT MERGE] Experimental implementation of CausalLM with a Keras Functional backbone_with_cache keras-team/keras-nlp#1598

Draft

james77777778 mentioned this issue May 7, 2024

Add scan op #19681

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🗺️ Keras Development Roadmap #19519

🗺️ Keras Development Roadmap #19519

fchollet commented Apr 15, 2024 •

edited

pure-rgb commented Apr 20, 2024

IMvision12 commented Apr 26, 2024

pure-rgb commented Apr 27, 2024

kernel-loophole commented May 2, 2024

🗺️ Keras Development Roadmap #19519

🗺️ Keras Development Roadmap #19519

Comments

fchollet commented Apr 15, 2024 • edited

Core Keras

Saving & export

Distribution

Performance

Modeling

Ecosystem

Code health

Keras "starter packs"

Keras Models

KerasCV

KerasNLP

pure-rgb commented Apr 20, 2024

IMvision12 commented Apr 26, 2024

pure-rgb commented Apr 27, 2024

kernel-loophole commented May 2, 2024

fchollet commented Apr 15, 2024 •

edited