-
Notifications
You must be signed in to change notification settings - Fork 56
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Kafka group.id to manage the number of workers Bytewax Docker #377
Comments
Hi rafael. The connector does not let kafka handle the offset with the The nice thing is that you don't need to use The warning is related to the fact the we pass the same config used in the consumer to an AdminClient we use when bootstrapping the dataflow to retrieve the list of partitions for each topic (see here). That client is seen as a "producer", so you get the warning, but you can safely ignore that. |
Thanks for the response. @Psykopear If I understand correctly, I will have to modify the entrypoint.sh used by the Image bytewax/bytewax:0.18.1-python3.11 #!/bin/sh
cd $BYTEWAX_WORKDIR
. /venv/bin/activate
python -m bytewax.run -w$NUM_WORKERS $BYTEWAX_PYTHON_FILE_PATH
echo 'Process ended.'
if [ "$BYTEWAX_KEEP_CONTAINER_ALIVE" = true ]
then
echo 'Keeping container alive...';
while :; do sleep 1; done
fi NUM_WORKERS is the known number of partitions of a given topic. so In a given Dockerfile I will just need to FROM bytewax/bytewax:0.18.1-python3.11
ENV NUM_WORKERS=10
COPY entrypoint.sh /bytewax/entrypoint.sh |
You shouldn't need to modify the entrypoint, as most of these configuration options already accept environment variables. As an example, you can set the number of workers per process using the environment variable |
Bug Description
I am building a containerized application of Bytewax and Kafka.
I am expecting to manage the parallelism with the group.id, the topic has a partition of 10, so I can have 10 replicas of the worker.
I was expecting to see different result by worker but all return the same value (meaning that they are not using consumer group
tracking the value I found this message:
If I am setting up a input (consumer) Why I am receiving this messages. Is this the right approach to use bytewax with Kafka? Inst the kop.input a consumer?
Python version (
python -V
)Python 3.11
Bytewax version (pip list | grep bytewax)
bytewax 18,1
Operating System version (
uname -morp
)FROM bytewax/bytewax:0.18.1-python3.11
Relevant log output
Steps to Reproduce
(To reproduce exactly the problem we will have to setup the env)
The text was updated successfully, but these errors were encountered: