Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Encountered error in sinkFn - CANCELLED: client cancelled #1652

Open
nagarajatantry opened this issue Apr 8, 2024 · 2 comments
Open

Encountered error in sinkFn - CANCELLED: client cancelled #1652

nagarajatantry opened this issue Apr 8, 2024 · 2 comments
Labels
bug Something isn't working opex Operational Excellence to make it easy to run in production and debug
Milestone

Comments

@nagarajatantry
Copy link
Contributor

Update numaflow controller from rc1 to rc4. I see this error message in the sink vertex. Sink Pods remained in Running State.

Error in numa container

{"level":"error","ts":"2024-04-08T18:38:21.170233545Z","logger":"numaflow.Sink-processor","caller":"forward/forward.go:415","msg":"Retrying failed messages","pipeline":"kafka-test-pipeline-1","vertex":"custom-out","errors":{"gRPC client.SinkFn failed, failed to execute stream.Send(value:\"..."  event_time:{seconds:1712601333  nanos:777000000}  watermark:{seconds:-62135596800}  id:\"\\x00\\x00\\x00\\x00\\x00\\xe25\\xc5-input-0\"): rpc error: code = Internal desc = grpc: error while marshaling: string field contains invalid UTF-8":98},"pipeline":"kafka-test-pipeline-1","vertex":"custom-out","partition_name":"custom-out","stacktrace":"github.com/numaproj/numaflow/pkg/sinks/forward.(*DataForward).writeToBuffer\n\t/Users/yhl01/Documents/numaproj/numaflow/pkg/sinks/forward/forward.go:415\ngithub.com/numaproj/numaflow/pkg/sinks/forward.(*DataForward).forwardAChunk\n\t/Users/yhl01/Documents/numaproj/numaflow/pkg/sinks/forward/forward.go:271\ngithub.com/numaproj/numaflow/pkg/sinks/forward.(*DataForward).Start.func1\n\t/Users/yhl01/Documents/numaproj/numaflow/pkg/sinks/forward/forward.go:133"}

error in custom sink container

2024-04-08T18:38:21,173+0000-ERROR-"grpc-default-executor-0" -i.n.n.sinker.Service-68-Encountered error in sinkFn - CANCELLED: client cancelled 
@nagarajatantry nagarajatantry added the bug Something isn't working label Apr 8, 2024
@vigith
Copy link
Contributor

vigith commented Apr 8, 2024

This is because of the stale messages in the ISB. I am assuming that the error count should have spiked up and alerted the user. We should think of a better user experience?

@vigith vigith added the opex Operational Excellence to make it easy to run in production and debug label Apr 8, 2024
@vigith vigith added this to the 1.2 milestone Apr 8, 2024
@nagarajatantry
Copy link
Contributor Author

this was in a nonprod environment with very low tps, so it would have been difficult to catch with an alert. We may need a better way to detect from the platform perspective since the id field is managed internally by the platform

@whynowy whynowy modified the milestones: 1.2, 1.3 May 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working opex Operational Excellence to make it easy to run in production and debug
Projects
None yet
Development

No branches or pull requests

3 participants