New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How it is different with Datafuse? #207
Comments
@alexey-milovidov Alex! Really appreciated for your great wishes from ClickHouse team!
In terms of general product strategy, there are many information not released indeed. From the detailed tech implementation, ClickHouse(CH, for short), of course, is our great teacher. But, furthermore, TensorBase(TB, for short) also wants to systematically correct the problems shown in ClickHouse and from the view of my seven-year experiences of bigdata system development and operation.
Datafuse(DFe, for short) is a sad topic. I am used to invite the team of DFe several months ago. But they've got good memory from local ventures. The invitation was not accepted. Before this invitation, the project is licensed under a commercially unfriendly license AGPL. Personally, I do not lookup any source of commercially unfriendly licensed project. Before this invitation, I have built one prototype of pure Rust engine based on the Arrow and DataFusion. And it works greatly and become the new initial base of TB as you seen. After the time of invitation, I evaluated the DFe which has been changed to the APL licence. The conclusion is, basically there is nothing new in DFe except for a good engineering structure (but a nice engineering structure is a low-hanging fruit in my humble opinion):
In TensorBase, we have a heart of change. This is the fundamental difference to DFe from TB.
Back to the first question, TensorBase, in fact, has pinned a very different way to ClickHouse and of course to CH's clones :
Several local ventures ask me a question: if DFe and similar projects copy your codes, then how about you? I have no idea. Your CH has been copied everywhere but it has Yandex backed. This is not the case for TB. So, for your first question, my current answer is: we reveal them when they come. It is hoped that, if they copy, the world knows that things are copied from TB. Finally, I am not sure, if there is an opportunity to work with the Yandex/CH team in the next-generation data warehouse from the new engineering of TB. TB now is a good friend in CH ecosystem. TB is open for any possibility. |
@alexey-milovidov
@jinmingjian Comments are cheap, show me the product and solve actual problem! |
@jinmingjian Raft is the choice right now only because it provides a well defined engineering architecture, and is easy to use for building a prototype, not because it has any advantages over any of other consensus protocols. We've been working on something new that adapts to large scale cross-DC and cross-cloud deployment. I can't agree more that repeating something is quite boring. Creating is the only thing interested me. And we can't wait to share with the community about what we created, what we did right and what we did wrong. |
Hi, @BohuTANG , take it easy:) My comment is truly biased on my understanding. Alex or more people asks this problem because they feel to find common things. As the author, I explain the difference from my view. So, this is just my own opinion. I may do not understand what you are doing. But modern, real-time, data, analytics, cloud are also what TB is pursuing. We are sure to have our own understanding on these. It is no problem to have two or more open source projects on one direction. And the authors have the rights to decide how to build their projects. I do not say you are wrong. I just say I do not do the things like yours. |
Thanks for sharing, raft is still good, and welcome your new protocol:) I just show the different thinkings of TB. But "comment is cheap". I just leave the answer for time.
good wishes for you:) |
Ok. I will just keep an eye on both of these projects. |
@jinmingjian BTW, I have tested Datafuse, |
@alexey-milovidov Alex, thanks for feedback and benchmark sharing! I have seen too many such things in the local community, unfair comparison is just one of them. These operations are only destructive to the open source community. As a member of entire open source community, I also hope everyone in the community could respect innovations to help rather than destruct the community. |
In terms of general product strategy, direction?
These projects look very similar, I'm wondering how do they differ.
PS. Best wishes from ClickHouse team :)
The text was updated successfully, but these errors were encountered: