{"payload":{"feedbackUrl":"https://github.com/orgs/community/discussions/53140","repo":{"id":705695450,"defaultBranch":"main","name":"distilabel","ownerLogin":"argilla-io","currentUserCanPush":false,"isFork":false,"isEmpty":false,"createdAt":"2023-10-16T14:12:33.000Z","ownerAvatar":"https://avatars.githubusercontent.com/u/18415507?v=4","public":true,"private":false,"isOrgOwned":true},"refInfo":{"name":"","listCacheKey":"v0:1716214307.0","currentOid":""},"activityList":{"items":[{"before":"690013ada15950322eead35f68dbbbe2d149ee62","after":"fac5fdcf3f9b85b126ce94bb2e36c8ae0ba661e0","ref":"refs/heads/develop","pushedAt":"2024-05-20T14:23:07.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"plaguss","name":"Agus","path":"/plaguss","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/56895847?s=80&v=4"},"commit":{"message":"Prepare branch for v1.2.0","shortMessageHtmlLink":"Prepare branch for v1.2.0"}},{"before":"8a5742738b2ef2d84a9ed220b1fe299953f51eb0","after":"9030f33b19393283fc03ca7f770424567dc4a009","ref":"refs/heads/gh-pages","pushedAt":"2024-05-20T14:13:04.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"github-actions[bot]","name":null,"path":"/apps/github-actions","primaryAvatarUrl":"https://avatars.githubusercontent.com/in/15368?s=80&v=4"},"commit":{"message":"Deployed 690013a to dev with MkDocs 1.6.0 and mike 2.1.1","shortMessageHtmlLink":"Deployed 690013a to dev with MkDocs 1.6.0 and mike 2.1.1"}},{"before":null,"after":"690013ada15950322eead35f68dbbbe2d149ee62","ref":"refs/heads/develop","pushedAt":"2024-05-20T14:11:47.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"gabrielmbmb","name":"Gabriel Martín Blázquez","path":"/gabrielmbmb","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/29572918?s=80&v=4"},"commit":{"message":"Fix routing batch function deadlocks and unordered batches (#649)\n\n* Add checking step `input_batch_size` multiple\r\n\r\n* Fix unordered batches when using `routing_batch_function`\r\n\r\n* Fix `can_generate` condition\r\n\r\n* Remove metadata and style\r\n\r\n* Fix getting data for batch when irregular batch sizes\r\n\r\n* Fix steps receiving routed batches getting stuck\r\n\r\n* Fix `_last_batch_convergence_step` method\r\n\r\n* Fix stop not checking for `None`\r\n\r\n* Fix issues related to the queues\r\n\r\n* Remove unused variable\r\n\r\n* Add integration tests timeout\r\n\r\n* Fix deadlock caused becase next expected batch in convergence step\r\n\r\n* Update unit tests\r\n\r\n* Add timeout to tests\r\n\r\n* Simplify condition\r\n\r\n* Fix unit test\r\n\r\n* Update timeouts","shortMessageHtmlLink":"Fix routing batch function deadlocks and unordered batches (#649)"}},{"before":"5015919e74c22ad9704240b8a2bbbb3f71592f97","after":"8a5742738b2ef2d84a9ed220b1fe299953f51eb0","ref":"refs/heads/gh-pages","pushedAt":"2024-05-20T14:03:23.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"github-actions[bot]","name":null,"path":"/apps/github-actions","primaryAvatarUrl":"https://avatars.githubusercontent.com/in/15368?s=80&v=4"},"commit":{"message":"Deployed 690013a to 1.1.0 with MkDocs 1.6.0 and mike 2.1.1","shortMessageHtmlLink":"Deployed 690013a to 1.1.0 with MkDocs 1.6.0 and mike 2.1.1"}},{"before":"9f38b4931398f626e07cbe2a83ef393de661f428","after":"690013ada15950322eead35f68dbbbe2d149ee62","ref":"refs/heads/main","pushedAt":"2024-05-20T13:54:36.000Z","pushType":"push","commitsCount":44,"pusher":{"login":"gabrielmbmb","name":"Gabriel Martín Blázquez","path":"/gabrielmbmb","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/29572918?s=80&v=4"},"commit":{"message":"Fix routing batch function deadlocks and unordered batches (#649)\n\n* Add checking step `input_batch_size` multiple\r\n\r\n* Fix unordered batches when using `routing_batch_function`\r\n\r\n* Fix `can_generate` condition\r\n\r\n* Remove metadata and style\r\n\r\n* Fix getting data for batch when irregular batch sizes\r\n\r\n* Fix steps receiving routed batches getting stuck\r\n\r\n* Fix `_last_batch_convergence_step` method\r\n\r\n* Fix stop not checking for `None`\r\n\r\n* Fix issues related to the queues\r\n\r\n* Remove unused variable\r\n\r\n* Add integration tests timeout\r\n\r\n* Fix deadlock caused becase next expected batch in convergence step\r\n\r\n* Update unit tests\r\n\r\n* Add timeout to tests\r\n\r\n* Simplify condition\r\n\r\n* Fix unit test\r\n\r\n* Update timeouts","shortMessageHtmlLink":"Fix routing batch function deadlocks and unordered batches (#649)"}},{"before":"e349cd427f1cc246dbfcc62c313cfdb42758ada7","after":"5015919e74c22ad9704240b8a2bbbb3f71592f97","ref":"refs/heads/gh-pages","pushedAt":"2024-05-20T13:53:54.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"github-actions[bot]","name":null,"path":"/apps/github-actions","primaryAvatarUrl":"https://avatars.githubusercontent.com/in/15368?s=80&v=4"},"commit":{"message":"Deployed 9f38b49 to dev with MkDocs 1.6.0 and mike 2.1.1","shortMessageHtmlLink":"Deployed 9f38b49 to dev with MkDocs 1.6.0 and mike 2.1.1"}},{"before":"d0db7fbd62255d0aa38c647a9ada9539ac8d831c","after":"9f38b4931398f626e07cbe2a83ef393de661f428","ref":"refs/heads/main","pushedAt":"2024-05-20T13:52:39.000Z","pushType":"force_push","commitsCount":0,"pusher":{"login":"gabrielmbmb","name":"Gabriel Martín Blázquez","path":"/gabrielmbmb","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/29572918?s=80&v=4"},"commit":{"message":"Add `stop` and `stop_sequences` in `LLM.generate` subclasses (#585)\n\n* Add `stop_sequences` arg to `InferenceEndpointsLLM.generate`\r\n\r\n* Add `stop` arg to `OpenAILLM.generate`\r\n\r\n* Set `stop_token_ids` from `eos_token_id` if not set\r\n\r\n* Bump version to `1.0.3`\r\n\r\n* Skip `tokenizer.eos_token_id` defaults for `stop_sequences`\r\n\r\nShould be handled in the LLM / TGI / etc. side, so as long as we can set the values we're good, no need to set defaults too, as those can be misleading and wrong in some cases\r\n\r\nCo-authored-by: Gabriel Martin \r\n\r\n---------\r\n\r\nCo-authored-by: Gabriel Martin ","shortMessageHtmlLink":"Add stop and stop_sequences in LLM.generate subclasses (#585)"}},{"before":"690013ada15950322eead35f68dbbbe2d149ee62","after":null,"ref":"refs/heads/develop","pushedAt":"2024-05-20T13:46:05.000Z","pushType":"branch_deletion","commitsCount":0,"pusher":{"login":"plaguss","name":"Agus","path":"/plaguss","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/56895847?s=80&v=4"}},{"before":"9f38b4931398f626e07cbe2a83ef393de661f428","after":"d0db7fbd62255d0aa38c647a9ada9539ac8d831c","ref":"refs/heads/main","pushedAt":"2024-05-20T13:46:04.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"plaguss","name":"Agus","path":"/plaguss","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/56895847?s=80&v=4"},"commit":{"message":"`distilabel` v1.1 (#559)\n\n* Allow nested connect calls and overload rshift method to connect steps (#490)\r\n\r\n* Allow nested connect calls and overload rshift method to connect steps\r\n\r\n* Update src/distilabel/steps/base.py\r\n\r\nCo-authored-by: Alvaro Bartolome \r\n\r\n* Update tests/unit/pipeline/test_base.py\r\n\r\nCo-authored-by: Alvaro Bartolome \r\n\r\n* Update tests/unit/pipeline/test_base.py\r\n\r\nCo-authored-by: Alvaro Bartolome \r\n\r\n* Update tests/unit/pipeline/test_base.py\r\n\r\nCo-authored-by: Alvaro Bartolome \r\n\r\n* Update tests/unit/pipeline/test_base.py\r\n\r\nCo-authored-by: Alvaro Bartolome \r\n\r\n* Update tests/unit/pipeline/test_base.py\r\n\r\nCo-authored-by: Alvaro Bartolome \r\n\r\n* Add comment to simplify reading the tests\r\n\r\n* Add reference on the Pipeline of alternative ways of connecting the steps\r\n\r\n---------\r\n\r\nCo-authored-by: Alvaro Bartolome \r\n\r\n* Fix `llm_blender`installation from `argilla-io` fork (#557)\r\n\r\n* Warn user about unknown runtime parameters (#555)\r\n\r\n* Add warning of unknown runtime parameters\r\n\r\n* Run check only when in a pipeline, to allow unit tests for runtime parameters to pass\r\n\r\n* Add missing `model_name`, update docstrings, and add `*.jinja2` templates to `Task` subclasses (#560)\r\n\r\n* Remove not required `else` statement in `UltraFeedback`\r\n\r\n* Add missing `model_name` and clean formatting in `SelfInstruct`\r\n\r\n* Move `QualityScorer` template to `quality-scorer.jinja2`\r\n\r\n* Move `ComplexityScorer` template to `complexity-scorer.jinja2`\r\n\r\n* Add `model_name` to `GenerateEmbeddings`\r\n\r\n* Fix docstrings in `InstructionBacktranslation`\r\n\r\n* Remove `input_batch_size` in `PairRM`\r\n\r\n* Add `model_name` in `PairRM` and update docstrings\r\n\r\n* Add `model_name` and missing docstrings in `ComplexityScorer`\r\n\r\n* Fix docstrings and add `model_name` in `QualityScorer`\r\n\r\n* Fix `TestPairRM` with `model_name` and `input_batch_size`\r\n\r\n* Split `ChatGeneration` from `TextGeneration` (#558)\r\n\r\n* Add `ChatGeneration` and rename `text_generation->generation`\r\n\r\n* Add `ChatGeneration` tests and catch `DeprecationWarning`\r\n\r\n* Revert `generation.py` rename and add `system_prompt` to `{Chat,Text}Generation`\r\n\r\n* Add missing tests for `{Chat,Text}Generation`\r\n\r\n* Add missing `InstructionBacktranslation` task in `preference_tasks.md`\r\n\r\n* Fix weird characters around `#`\r\n\r\n* Update `docs/` related to `{Chat,Text}Generation`\r\n\r\n* Add `use_system_prompt` in `TextGeneration`\r\n\r\n* Update `text_generation.md`\r\n\r\n* Set `extra=\"forbid\"` in `{_Step,LLM}.model_config` (#577)\r\n\r\n* Set `extra=\"forbid\"` in `_Step.model_config`\r\n\r\n* Set `extra=\"forbid\"` in `LLM.model_config`\r\n\r\nCo-authored-by: Gabriel Martin \r\n\r\n* Fix `TestVertexAILLM` since `api_key` does not exist\r\n\r\nMost likely due to a copy-over from an existing test\r\n\r\n* Pop `runtime_paremeters_info` in `from_dict`\r\n\r\nAs `runtime_parameters_info` is only used in the CLI, and not required to instantiate a `_Step` subclass\r\n\r\n---------\r\n\r\nCo-authored-by: Gabriel Martin \r\n\r\n* Infer `Step.name` if not provided (#575)\r\n\r\n* Log with a warning instead of raising an error when a step name is not found when passing runtime parameters via pipeline.run\r\n\r\n* Infer the name of the step if the user doesn't set one\r\n\r\n* Make the function to infer the name private and add docstrings\r\n\r\n* Add tests for the inferred name of steps and move the call to be done after the pipeline is set via the pipeline manager\r\n\r\n* Update src/distilabel/steps/base.py\r\n\r\nCo-authored-by: Gabriel Martín Blázquez \r\n\r\n* Update warning message to make it more explicit\r\n\r\n* Fix name inference with more than 10 steps and possible repeated names\r\n\r\n---------\r\n\r\nCo-authored-by: Gabriel Martín Blázquez \r\n\r\n* Set `spawn` as multiprocessing start method if Windows (#578)\r\n\r\n* Dump logs within a file in `.cache/distilabel/pipelines` dir (#568)\r\n\r\n* Write logs to file in the cache folder\r\n\r\n* Push log file to the dataset in hugging face hub\r\n\r\n* Ensure the cache folder exists when the pipeline log file is created on setup_logging\r\n\r\n* Fix docstring\r\n\r\n* Update log handlers to write the timestamp and simplify the setup logging via the queue listener\r\n\r\n* Fix empty batches causing missaligment when branching (#590)\r\n\r\n* Fix empty batches causing missaligment when branching\r\n\r\n* Add `_BatchManager.can_generate` unit test\r\n\r\n* Update tests/integration/test_branching_missaligmnent.py\r\n\r\nCo-authored-by: Agus \r\n\r\n---------\r\n\r\nCo-authored-by: Agus \r\n\r\n* Add `GroqLLM` (#583)\r\n\r\n* Add `GroqLLM`\r\n\r\nCo-authored-by: Krishna Tripathi \r\n\r\n* Improve dependency installation in `test.yml`\r\n\r\n* Add `GroqLLM` to documentation\r\n\r\n* Add `TestGroqLLM`\r\n\r\n* Remove extra line break in `CohereLLM` docstring\r\n\r\n* Add `GroqLLM` docstring\r\n\r\n---------\r\n\r\nCo-authored-by: Krishna Tripathi \r\n\r\n* Add `Format{Chat,Text}Generation{DPO,SFT}` (#584)\r\n\r\n* Add `Format{Chat,Text}Generation{DPO,SFT}` (WIP)\r\n\r\n* Add `TestFormat{Chat,Text}GenerationSFT`\r\n\r\n* Add `TestFormat{Chat,Text}GenerationDPO`\r\n\r\n* Fix `title` in `RatingQuestion` of `PreferenceToArgilla` (#597)\r\n\r\n* Remove `based on the annotation guidelines` from `PreferenceToArgilla`\r\n\r\n* Add missing `# type: ignore`\r\n\r\n* Fix `test_preference`\r\n\r\n* Set `streaming=False` and add `num_examples` to `LoadHubDataset` (#565)\r\n\r\n* Update LoadHubDataset to allow for more flexibility related to streaming and column fetching via API\r\n\r\n* Add tests for new load_hub_dataset cases\r\n\r\n* Fix docstring comment\r\n\r\n* Allow passing num_examples as a runtime parameter to simplify loading small number of examples from datasets\r\n\r\n* Update src/distilabel/steps/generators/huggingface.py\r\n\r\nCo-authored-by: Alvaro Bartolome \r\n\r\n---------\r\n\r\nCo-authored-by: Alvaro Bartolome \r\n\r\n* Make `pipeline` argument of `Step` optional (#566)\r\n\r\n* Make pipeline argument of step optional\r\n\r\n* Fix failing tests\r\n\r\n* Fix logger instantiation from model_post_init of step\r\n\r\n* Update tests/unit/pipeline/test_dag.py\r\n\r\nCo-authored-by: Alvaro Bartolome \r\n\r\n* Update tests/unit/steps/tasks/evol_instruct/test_base.py\r\n\r\nCo-authored-by: Alvaro Bartolome \r\n\r\n* Update tests/unit/steps/tasks/evol_instruct/test_generator.py\r\n\r\nCo-authored-by: Alvaro Bartolome \r\n\r\n* Update tests/unit/steps/tasks/evol_quality/test_base.py\r\n\r\nCo-authored-by: Alvaro Bartolome \r\n\r\n* Update tests/unit/steps/tasks/test_base.py\r\n\r\nCo-authored-by: Alvaro Bartolome \r\n\r\n* Update tests/unit/steps/test_base.py\r\n\r\nCo-authored-by: Alvaro Bartolome \r\n\r\n* Update tests/unit/steps/test_base.py\r\n\r\nCo-authored-by: Alvaro Bartolome \r\n\r\n* Fix comment from review\r\n\r\n* Format tests\r\n\r\n---------\r\n\r\nCo-authored-by: Alvaro Bartolome \r\n\r\n* Extend `LLM` kwargs to align with counterparts (#594)\r\n\r\n* Extend `vLLM` supported kwargs\r\n\r\n* Extend `LlamaCppLLM` supported kwargs\r\n\r\n* Fix `super().load()` placement in `LlamaCppLLM`\r\n\r\nRevert from a previous commit where this was changed unintentionally, since no notes were written down about it, but now a note and context has been included to prevent this from happening to someone else\r\n\r\n* Add missing kwargs in `InferenceEndpointsLLM.agenerate`\r\n\r\n* Rename `llamacpp_llm` to `llm`\r\n\r\n* Add `Genstruct` task (#600)\r\n\r\n* Add `Genstruct` and `genstruct.jinja2`\r\n\r\n* Add `test_genstruct.py`\r\n\r\n* Fix `Genstruct` regex\r\n\r\n* Fix `num_examples` to be optional in `LoadHubDataset` (#603)\r\n\r\n* Fix `list_files_in_dir` returning unsorted files (#609)\r\n\r\nCo-authored-by: plaguss \r\n\r\n* Add `PrometheusEval` task (#610)\r\n\r\n* Add `PrometheusAbsEval`\r\n\r\n* Add `PrometheusRelEval`\r\n\r\n* Add `Prometheus{Abs,Rel}Eval` docstrings\r\n\r\n* Add imports in `distilabel.steps.tasks`\r\n\r\n* Fix import order in `UltraFeedback`\r\n\r\n* Fix `_template` path in `Prometheus{Abs,Rel}Eval`\r\n\r\n* Add `TestPrometheus{Abs,Rel}Eval`\r\n\r\n* Add missing `model_name` in `Genstruct` and `Prometheus{Abs,Rel}Eval`\r\n\r\n* Combine `Prometheus{Abs,Rel}Eval` into `PrometheusEval`\r\n\r\n* Update `test_imports` and `test_prometheus_eval`\r\n\r\n* Update `ValueError` on missing inputs message (#617)\r\n\r\n* Run `codespell` to fix typos\r\n\r\n* Update `ValueError` exception on missing inputs\r\n\r\n* Add `routing_batch_function` (#595)\r\n\r\n* Update `connect` to accept `*args` and `routing_batch_function`\r\n\r\n* Use `routing_batch_function`\r\n\r\n* Add `convergence_step` attribute\r\n\r\n* Update `_BatchManager` buffers to store `_Batch`es\r\n\r\n* Fix `step_empty_buffers` method\r\n\r\n* Fix steps not processing all the rows because no batch copy\r\n\r\n* Add `stop` and `stop_sequences` in `LLM.generate` subclasses (#585)\r\n\r\n* Add `stop_sequences` arg to `InferenceEndpointsLLM.generate`\r\n\r\n* Add `stop` arg to `OpenAILLM.generate`\r\n\r\n* Set `stop_token_ids` from `eos_token_id` if not set\r\n\r\n* Bump version to `1.0.3`\r\n\r\n* Skip `tokenizer.eos_token_id` defaults for `stop_sequences`\r\n\r\nShould be handled in the LLM / TGI / etc. side, so as long as we can set the values we're good, no need to set defaults too, as those can be misleading and wrong in some cases\r\n\r\nCo-authored-by: Gabriel Martin \r\n\r\n---------\r\n\r\nCo-authored-by: Gabriel Martin \r\n\r\n* Dump logs within a file in `.cache/distilabel/pipelines` dir (#568)\r\n\r\n* Write logs to file in the cache folder\r\n\r\n* Push log file to the dataset in hugging face hub\r\n\r\n* Ensure the cache folder exists when the pipeline log file is created on setup_logging\r\n\r\n* Fix docstring\r\n\r\n* Update log handlers to write the timestamp and simplify the setup logging via the queue listener\r\n\r\n* Fix empty batches causing missaligment when branching (#590)\r\n\r\n* Fix empty batches causing missaligment when branching\r\n\r\n* Add `_BatchManager.can_generate` unit test\r\n\r\n* Update tests/integration/test_branching_missaligmnent.py\r\n\r\nCo-authored-by: Agus \r\n\r\n---------\r\n\r\nCo-authored-by: Agus \r\n\r\n* Add checking if can create batch for convergence step\r\n\r\n* Remove unit test\r\n\r\n* Add `GroqLLM` (#583)\r\n\r\n* Add `GroqLLM`\r\n\r\nCo-authored-by: Krishna Tripathi \r\n\r\n* Improve dependency installation in `test.yml`\r\n\r\n* Add `GroqLLM` to documentation\r\n\r\n* Add `TestGroqLLM`\r\n\r\n* Remove extra line break in `CohereLLM` docstring\r\n\r\n* Add `GroqLLM` docstring\r\n\r\n---------\r\n\r\nCo-authored-by: Krishna Tripathi \r\n\r\n* Add `_get_data_for_convergence_step` method\r\n\r\n* Add sending `LAST_BATCH_SENT_FLAG` to steps\r\n\r\n* It's working\r\n\r\n* Add `routing_batch_function` decorator\r\n\r\n* Fix circular import\r\n\r\n* Fix pyright `>>` errors\r\n\r\n* Having second thoughts with this thing\r\n\r\n* Confirmed\r\n\r\n* Add `routing_batch_function` related dag validation\r\n\r\n* Add test for routing batch function\r\n\r\n* Add `time.sleep`\r\n\r\n* Fix `_BatchManager` unit tests\r\n\r\n* Remove unit test\r\n\r\n* Fix unit test\r\n\r\n* Add convergence step batch manager unit tests\r\n\r\n* Add example\r\n\r\n* Add missing attributes to docstring\r\n\r\n* Add `sample_n_steps` routing batch function\r\n\r\n* Add `routing_batch_function` docs\r\n\r\n* Add pipeline typing api\r\n\r\n* Add DAG nodes keys constants\r\n\r\n* Fix `test` workflow\r\n\r\n* Fix typo\r\n\r\nCo-authored-by: Agus \r\n\r\n---------\r\n\r\nCo-authored-by: Alvaro Bartolome \r\nCo-authored-by: Gabriel Martin \r\nCo-authored-by: Agus \r\nCo-authored-by: Krishna Tripathi \r\n\r\n* Fix `pipeline.log` inconsistency & include LLM info in signature (#598)\r\n\r\n* Include LLM info in signature\r\n\r\n* Add draft of fix\r\n\r\n* Fix problem with pipeline.log order of cache folder creation\r\n\r\n* Fix creation of parent dir if doesn't already exist\r\n\r\n* Add custom `rubrics` attribute to `PrometheusEval` (#621)\r\n\r\n* Add `rubrics` attribute in `PrometheusEval`\r\n\r\n* Fix `typing.Self` import to `typing_extensions`\r\n\r\n* Fix `TestPrometheusEval` to use `_DEFAULT_RUBRICS`\r\n\r\n* Update `PrometheusEval` docstrings\r\n\r\n* Add tests for `rubrics` in `PrometheusEval`\r\n\r\n* Remove duplicated `Dict[str,str]` validation\r\n\r\nCo-authored-by: Gabriel Martin \r\n\r\n---------\r\n\r\nCo-authored-by: Gabriel Martin \r\n\r\n* Update `UltraFeedback` paper replication to use `routing_batch_function` (#620)\r\n\r\n* Update `UltraFeedback` paper replication to use `routing_batch_function`\r\n\r\n* Add note about batch sizes\r\n\r\n* Update docs/sections/papers/ultrafeedback.md\r\n\r\n* Add `distilabel_metadata` column to the datasets to include general data (#586)\r\n\r\n* Add the option of passing the multiprocessing context via env var (#604)\r\n\r\n* Add name of the pipeline to group the hashed folders (#626)\r\n\r\n* Add `routing_batch_function` serialization (#628)\r\n\r\n* Add `RoutingBatchFunction` serialization\r\n\r\n* Add `RoutingBatchFunction` serialization\r\n\r\n* Update info command\r\n\r\n* Fix routing batch function tests\r\n\r\n* Fix `@routing_batch_function` detecting factory function bug\r\n\r\n* Don't serialize `model_path` in `LlamaCpp`\r\n\r\n* Fix problem of sorting files whose name is numbered (#622)\r\n\r\n* Add `dry_run` method to the pipelines to run with a single example. (#635)\r\n\r\n* [FEATURE] Add structured outputs using `outlines` (#601)\r\n\r\nCo-authored-by: Alvaro Bartolome \r\nCo-authored-by: Gabriel Martín Blázquez \r\n\r\n* Force pipeline stop after 2 SIGINT signals caught (#630)\r\n\r\n* Refactor and update `docs` (#634)\r\n\r\n* Bump version to 1.1.0\r\n\r\n* Fix `project.license` in `pyproject.toml`\r\n\r\n* Fix `docs/scripts/gen_ref_pages.py`\r\n\r\n* Update `description` in `pyproject.toml` and `mkdocs.yml`\r\n\r\n* Update `index.md`\r\n\r\n* Remove unused `docs/overview.md`\r\n\r\n* Update `mkdocs.yml` and `docs/*md` (WIP)\r\n\r\n* Fix typo in `ComplexityScorer` docstring\r\n\r\n* Update `mkdocs.yml` and `docs/*.md` (WIP)\r\n\r\n* Fix indentation after `!!! NOTE`\r\n\r\n* Update `mkdocs.yml` and `docs/*md` (WIP)\r\n\r\n* Update `mkdocs.yml` and `docs/*md` (WIP)\r\n\r\n* Update `mkdocs.yml` and `docs/*md` (WIP)\r\n\r\n* Include example of dry_run method\r\n\r\n* Fix link to pipeline\r\n\r\n* Change section names with class methods\r\n\r\n* Add FAQ section in `docs` (WIP)\r\n\r\n* Update `docs/sections/components/step/index.md`\r\n\r\n* Update `docs/sections/components/step/index.md`\r\n\r\n* Fix typos via `codespell`\r\n\r\n* Apply suggestions from code review\r\n\r\n* Update docs structured outputs (#636)\r\n\r\n* Update `step/index.md` and `faq.md`\r\n\r\n* Update `step/*.md` and `task/index.md`\r\n\r\n* Update more docs (#637)\r\n\r\n* Update `llm/index.md`\r\n\r\nNot sure if the structured generation fits nicely there, as that's most likely a tutorial or a subpage or something else (?)\r\n\r\n* Rename `docs/components/pipelines` to `docs/components/pipeline`\r\n\r\n* Update docs/sections/components/llm/index.md\r\n\r\nCo-authored-by: Alvaro Bartolome \r\n\r\n* Fix `@step` section name\r\n\r\n* Update `generator_task.md`\r\n\r\n* Move structured outputs to it's own section and include references (#638)\r\n\r\n* Update FAQ layout (#639)\r\n\r\n* Apply suggestions from code review\r\n\r\nCo-authored-by: Ignacio Talavera \r\n\r\n* Update docs/sections/faq.md\r\n\r\nCo-authored-by: Alvaro Bartolome \r\n\r\n* Update docs/sections/faq.md\r\n\r\nCo-authored-by: Alvaro Bartolome \r\n\r\n* refactor file name for explicitness (#641)\r\n\r\n* Fix file name to render docs (#642)\r\n\r\n* Update `docs/*.md` and `mkdocs.yml` (WIP)\r\n\r\n* Update `docs/*.md` and `mkdocs.yml` (WIP)\r\n\r\n* Run `codespell` to fix typos\r\n\r\n* Collapse examples to simplify reading (#643)\r\n\r\n* Avoid long titles in `nav`\r\n\r\n* Update `docs/*.md` and `mkdocs.yml` (WIP)\r\n\r\n* Update `Task Gallery` API reference\r\n\r\n* Remove `prometheus.md` as not written yet\r\n\r\n---------\r\n\r\nCo-authored-by: plaguss \r\nCo-authored-by: Ignacio Talavera \r\n\r\n* Export components info & components gallery in docs (#640)\r\n\r\n* Refactor to `classmethod`s\r\n\r\n* Add `distilabel/components-gallery` mkdocs plugin\r\n\r\n* Update to exclude `self` parameter\r\n\r\n* Update docstrings\r\n\r\n* Fix parsing short and long description\r\n\r\n* Remove `self`\r\n\r\n* Revert \"Refactor to `classmethod`s\"\r\n\r\nThis reverts commit 3384de99472a600fc9e1815cf5013ede974d599e.\r\n\r\n* Remove exporting runtime parameters info with method\r\n\r\n* Remove leading whitespaces\r\n\r\n* Add GitHub icon in docs\r\n\r\n* Finish component-gallery\r\n\r\n* Update docstrings\r\n\r\n* Update `parse_google_docstring` unit tests\r\n\r\n* Update sections and fix warnings\r\n\r\n* Deploy `dev` version from `develop` branch\r\n\r\n* Add hide toc\r\n\r\n* Update `Format{Chat,Text}Generation{DPO,SFT}`\r\n\r\n* Fix wrong import in `step_gallery/extra.md`\r\n\r\n* Documentation updates (#646)\r\n\r\n* Include section for Note in gallery\r\n\r\n* Fix rendering\r\n\r\n* Fix step to llm in jinja template\r\n\r\n* Updated layout for examples/papers section and included default page for Learn section\r\n\r\n* Update unit tests to take into account the note section from docstrings\r\n\r\n* Nest components from steps and tasks gallery into its parent section\r\n\r\n* Refactor docs 1.1.0 (#650)\r\n\r\n* Remove redundant information of available objects, those are in the components gallery now\r\n\r\n* Remove more redundant documentation of available steps and tasks\r\n\r\n* Reintroduce input/output_mappings in steps\r\n\r\n* Add reference to runtime parameters\r\n\r\n* Fix routing batch function deadlocks and unordered batches (#649)\r\n\r\n* Add checking step `input_batch_size` multiple\r\n\r\n* Fix unordered batches when using `routing_batch_function`\r\n\r\n* Fix `can_generate` condition\r\n\r\n* Remove metadata and style\r\n\r\n* Fix getting data for batch when irregular batch sizes\r\n\r\n* Fix steps receiving routed batches getting stuck\r\n\r\n* Fix `_last_batch_convergence_step` method\r\n\r\n* Fix stop not checking for `None`\r\n\r\n* Fix issues related to the queues\r\n\r\n* Remove unused variable\r\n\r\n* Add integration tests timeout\r\n\r\n* Fix deadlock caused becase next expected batch in convergence step\r\n\r\n* Update unit tests\r\n\r\n* Add timeout to tests\r\n\r\n* Simplify condition\r\n\r\n* Fix unit test\r\n\r\n* Update timeouts\r\n\r\n---------\r\n\r\nCo-authored-by: Agus \r\nCo-authored-by: Gabriel Martín Blázquez \r\nCo-authored-by: Gabriel Martin \r\nCo-authored-by: Krishna Tripathi \r\nCo-authored-by: plaguss \r\nCo-authored-by: Ignacio Talavera ","shortMessageHtmlLink":"distilabel v1.1 (#559)"}},{"before":"bfa84844dca22188adb56085b39ad60b2865ed52","after":"e349cd427f1cc246dbfcc62c313cfdb42758ada7","ref":"refs/heads/gh-pages","pushedAt":"2024-05-20T13:24:49.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"github-actions[bot]","name":null,"path":"/apps/github-actions","primaryAvatarUrl":"https://avatars.githubusercontent.com/in/15368?s=80&v=4"},"commit":{"message":"Deployed 690013a to dev with MkDocs 1.6.0 and mike 2.1.1","shortMessageHtmlLink":"Deployed 690013a to dev with MkDocs 1.6.0 and mike 2.1.1"}},{"before":"31307d550cd87d6da6f7a65467ccda9ecb3405ce","after":null,"ref":"refs/heads/fix-routing-batch-function-order","pushedAt":"2024-05-20T13:23:34.000Z","pushType":"branch_deletion","commitsCount":0,"pusher":{"login":"gabrielmbmb","name":"Gabriel Martín Blázquez","path":"/gabrielmbmb","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/29572918?s=80&v=4"}},{"before":"4ea1fc0875094257937a2d7ca75bf8c4e01a31fe","after":"690013ada15950322eead35f68dbbbe2d149ee62","ref":"refs/heads/develop","pushedAt":"2024-05-20T13:23:33.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"gabrielmbmb","name":"Gabriel Martín Blázquez","path":"/gabrielmbmb","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/29572918?s=80&v=4"},"commit":{"message":"Fix routing batch function deadlocks and unordered batches (#649)\n\n* Add checking step `input_batch_size` multiple\r\n\r\n* Fix unordered batches when using `routing_batch_function`\r\n\r\n* Fix `can_generate` condition\r\n\r\n* Remove metadata and style\r\n\r\n* Fix getting data for batch when irregular batch sizes\r\n\r\n* Fix steps receiving routed batches getting stuck\r\n\r\n* Fix `_last_batch_convergence_step` method\r\n\r\n* Fix stop not checking for `None`\r\n\r\n* Fix issues related to the queues\r\n\r\n* Remove unused variable\r\n\r\n* Add integration tests timeout\r\n\r\n* Fix deadlock caused becase next expected batch in convergence step\r\n\r\n* Update unit tests\r\n\r\n* Add timeout to tests\r\n\r\n* Simplify condition\r\n\r\n* Fix unit test\r\n\r\n* Update timeouts","shortMessageHtmlLink":"Fix routing batch function deadlocks and unordered batches (#649)"}},{"before":"5bb87ff19f538fd1e745126bbfa5e7ce612591a9","after":"31307d550cd87d6da6f7a65467ccda9ecb3405ce","ref":"refs/heads/fix-routing-batch-function-order","pushedAt":"2024-05-20T13:05:15.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"gabrielmbmb","name":"Gabriel Martín Blázquez","path":"/gabrielmbmb","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/29572918?s=80&v=4"},"commit":{"message":"Update timeouts","shortMessageHtmlLink":"Update timeouts"}},{"before":"31492bc4015b8e77cf2c740da5acfd29798b41af","after":"bfa84844dca22188adb56085b39ad60b2865ed52","ref":"refs/heads/gh-pages","pushedAt":"2024-05-20T12:56:21.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"github-actions[bot]","name":null,"path":"/apps/github-actions","primaryAvatarUrl":"https://avatars.githubusercontent.com/in/15368?s=80&v=4"},"commit":{"message":"Deployed 4ea1fc0 to dev with MkDocs 1.6.0 and mike 2.1.1","shortMessageHtmlLink":"Deployed 4ea1fc0 to dev with MkDocs 1.6.0 and mike 2.1.1"}},{"before":"f3718551dbdf86038b66fba7fe64f6fa68fde96e","after":null,"ref":"refs/heads/refactor-docs-1.1.0","pushedAt":"2024-05-20T12:55:08.000Z","pushType":"branch_deletion","commitsCount":0,"pusher":{"login":"gabrielmbmb","name":"Gabriel Martín Blázquez","path":"/gabrielmbmb","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/29572918?s=80&v=4"}},{"before":"cd9148c38c35003322829eddfa2acded70ae69cf","after":"4ea1fc0875094257937a2d7ca75bf8c4e01a31fe","ref":"refs/heads/develop","pushedAt":"2024-05-20T12:55:07.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"gabrielmbmb","name":"Gabriel Martín Blázquez","path":"/gabrielmbmb","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/29572918?s=80&v=4"},"commit":{"message":"Refactor docs 1.1.0 (#650)\n\n* Remove redundant information of available objects, those are in the components gallery now\r\n\r\n* Remove more redundant documentation of available steps and tasks\r\n\r\n* Reintroduce input/output_mappings in steps\r\n\r\n* Add reference to runtime parameters","shortMessageHtmlLink":"Refactor docs 1.1.0 (#650)"}},{"before":"d64b9529ed81365c1fff7f1b7dce227427a3a783","after":"5bb87ff19f538fd1e745126bbfa5e7ce612591a9","ref":"refs/heads/fix-routing-batch-function-order","pushedAt":"2024-05-20T12:54:19.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"gabrielmbmb","name":"Gabriel Martín Blázquez","path":"/gabrielmbmb","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/29572918?s=80&v=4"},"commit":{"message":"Fix unit test","shortMessageHtmlLink":"Fix unit test"}},{"before":"2340e42ef989f98d32c5a05e46fa39f695f73d58","after":"d64b9529ed81365c1fff7f1b7dce227427a3a783","ref":"refs/heads/fix-routing-batch-function-order","pushedAt":"2024-05-20T12:39:32.000Z","pushType":"push","commitsCount":4,"pusher":{"login":"gabrielmbmb","name":"Gabriel Martín Blázquez","path":"/gabrielmbmb","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/29572918?s=80&v=4"},"commit":{"message":"Simplify condition","shortMessageHtmlLink":"Simplify condition"}},{"before":"4883074dacbb9281b482aac18082dc61c3c7bae2","after":"f3718551dbdf86038b66fba7fe64f6fa68fde96e","ref":"refs/heads/refactor-docs-1.1.0","pushedAt":"2024-05-20T08:45:31.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"plaguss","name":"Agus","path":"/plaguss","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/56895847?s=80&v=4"},"commit":{"message":"Add reference to runtime parameters","shortMessageHtmlLink":"Add reference to runtime parameters"}},{"before":null,"after":"4883074dacbb9281b482aac18082dc61c3c7bae2","ref":"refs/heads/refactor-docs-1.1.0","pushedAt":"2024-05-20T08:02:17.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"plaguss","name":"Agus","path":"/plaguss","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/56895847?s=80&v=4"},"commit":{"message":"Reintroduce input/output_mappings in steps","shortMessageHtmlLink":"Reintroduce input/output_mappings in steps"}},{"before":"ebf4fdac5f545b98572a61c86b5fef8ac7e6b4cf","after":"2340e42ef989f98d32c5a05e46fa39f695f73d58","ref":"refs/heads/fix-routing-batch-function-order","pushedAt":"2024-05-18T12:26:59.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"gabrielmbmb","name":"Gabriel Martín Blázquez","path":"/gabrielmbmb","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/29572918?s=80&v=4"},"commit":{"message":"Add integration tests timeout","shortMessageHtmlLink":"Add integration tests timeout"}},{"before":"0fb457f94e3c21bafccbc2b9508486c5025341ab","after":"ebf4fdac5f545b98572a61c86b5fef8ac7e6b4cf","ref":"refs/heads/fix-routing-batch-function-order","pushedAt":"2024-05-18T11:23:53.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"gabrielmbmb","name":"Gabriel Martín Blázquez","path":"/gabrielmbmb","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/29572918?s=80&v=4"},"commit":{"message":"Remove unused variable","shortMessageHtmlLink":"Remove unused variable"}},{"before":"6130ebc6b7a61012d5dccbc42527a74275700433","after":"0fb457f94e3c21bafccbc2b9508486c5025341ab","ref":"refs/heads/fix-routing-batch-function-order","pushedAt":"2024-05-18T11:12:07.000Z","pushType":"push","commitsCount":3,"pusher":{"login":"gabrielmbmb","name":"Gabriel Martín Blázquez","path":"/gabrielmbmb","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/29572918?s=80&v=4"},"commit":{"message":"Fix issues related to the queues","shortMessageHtmlLink":"Fix issues related to the queues"}},{"before":"1187f98a44bd78c8de1e39c8b5b2607a4fc087b4","after":"6130ebc6b7a61012d5dccbc42527a74275700433","ref":"refs/heads/fix-routing-batch-function-order","pushedAt":"2024-05-17T20:07:17.000Z","pushType":"push","commitsCount":3,"pusher":{"login":"gabrielmbmb","name":"Gabriel Martín Blázquez","path":"/gabrielmbmb","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/29572918?s=80&v=4"},"commit":{"message":"Fix steps receiving routed batches getting stuck","shortMessageHtmlLink":"Fix steps receiving routed batches getting stuck"}},{"before":"f1321dff138655d6e7b78fe3551ea2d6faafdba8","after":"31492bc4015b8e77cf2c740da5acfd29798b41af","ref":"refs/heads/gh-pages","pushedAt":"2024-05-17T14:45:02.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"github-actions[bot]","name":null,"path":"/apps/github-actions","primaryAvatarUrl":"https://avatars.githubusercontent.com/in/15368?s=80&v=4"},"commit":{"message":"Deployed cd9148c to dev with MkDocs 1.6.0 and mike 2.1.1","shortMessageHtmlLink":"Deployed cd9148c to dev with MkDocs 1.6.0 and mike 2.1.1"}},{"before":"a26aeff8cfa1d1dbe8e0a67a03dac40e106040bf","after":null,"ref":"refs/heads/docs-touches","pushedAt":"2024-05-17T14:43:46.000Z","pushType":"branch_deletion","commitsCount":0,"pusher":{"login":"plaguss","name":"Agus","path":"/plaguss","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/56895847?s=80&v=4"}},{"before":"3bac1ef6339fc86bf42c896f310520cdefc92e1b","after":"cd9148c38c35003322829eddfa2acded70ae69cf","ref":"refs/heads/develop","pushedAt":"2024-05-17T14:43:45.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"plaguss","name":"Agus","path":"/plaguss","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/56895847?s=80&v=4"},"commit":{"message":"Documentation updates (#646)\n\n* Include section for Note in gallery\r\n\r\n* Fix rendering\r\n\r\n* Fix step to llm in jinja template\r\n\r\n* Updated layout for examples/papers section and included default page for Learn section\r\n\r\n* Update unit tests to take into account the note section from docstrings\r\n\r\n* Nest components from steps and tasks gallery into its parent section","shortMessageHtmlLink":"Documentation updates (#646)"}},{"before":"da7e525761c779485ccecc31ef4a326bc6430de9","after":"a26aeff8cfa1d1dbe8e0a67a03dac40e106040bf","ref":"refs/heads/docs-touches","pushedAt":"2024-05-17T10:26:06.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"plaguss","name":"Agus","path":"/plaguss","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/56895847?s=80&v=4"},"commit":{"message":"Nest components from steps and tasks gallery into its parent section","shortMessageHtmlLink":"Nest components from steps and tasks gallery into its parent section"}},{"before":"35a03f3e0e7aa29b19db924047a3735a4417c162","after":"da7e525761c779485ccecc31ef4a326bc6430de9","ref":"refs/heads/docs-touches","pushedAt":"2024-05-17T10:20:49.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"plaguss","name":"Agus","path":"/plaguss","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/56895847?s=80&v=4"},"commit":{"message":"Update unit tests to take into account the note section from docstrings","shortMessageHtmlLink":"Update unit tests to take into account the note section from docstrings"}},{"before":"d323c6d89377caf3fbc3b4c5621307c05f373898","after":"35a03f3e0e7aa29b19db924047a3735a4417c162","ref":"refs/heads/docs-touches","pushedAt":"2024-05-17T10:15:53.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"plaguss","name":"Agus","path":"/plaguss","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/56895847?s=80&v=4"},"commit":{"message":"Updated layout for examples/papers section and included default page for Learn section","shortMessageHtmlLink":"Updated layout for examples/papers section and included default page …"}}],"hasNextPage":true,"hasPreviousPage":false,"activityType":"all","actor":null,"timePeriod":"all","sort":"DESC","perPage":30,"cursor":"djE6ks8AAAAETuDQVgA","startCursor":null,"endCursor":null}},"title":"Activity · argilla-io/distilabel"}