Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Discuss][RoadMap]BitSail 2023Q1 RoadMap #275

Open
12 tasks
lichang-bd opened this issue Dec 23, 2022 · 2 comments
Open
12 tasks

[Discuss][RoadMap]BitSail 2023Q1 RoadMap #275

lichang-bd opened this issue Dec 23, 2022 · 2 comments

Comments

@lichang-bd
Copy link
Member

lichang-bd commented Dec 23, 2022

Hi everyone.
The new year is coming, Looking forward to working with you this year to build the BitSail community better and bring convenience to more data developers
Here we can discuss the roadmap of BitSail in 2023Q1, Welcome to discuss and feel free to express your ideas

BitSail Connector

BitSail Basic Capacity building

BitSail Architecture Compatibility Improvement

BitSail Product Usability Optimization

  • Start to integrate with open source development platforms to provide front-end product pages
  • Explore more convenient access methods to apply BitSail to existing systems with low cost, such as API/SDK

BitSail Multi-Engine Architecture

  • Investigate multi-engine solutions and complete technical solution design
@lichang-bd lichang-bd pinned this issue Dec 23, 2022
@lichang-bd lichang-bd unpinned this issue Dec 23, 2022
@lichang-bd lichang-bd pinned this issue Dec 26, 2022
@zeliu
Copy link
Contributor

zeliu commented Dec 27, 2022

hi,
I have some ideas, just for reference:

1.We usually use a batch job to initialize the table first, and then use a stream job to do incremental synchronization. Can we start only one bitsail job to switch between two jobs ? using Batch/Streaming Unification or something else?

2.At present, the reader and writer of Bitsail are one-to-one. In some requirements, it may be one-to-many. For example, a changelog contains change records of multiple tables, and the writer may be multiple hudi tables.
In order to save computing resources, there are many scenarios for synchronizing many tables with one job. So I think it is necessary to support this feature.

3.columns is a mandatory parameter in the configuration of reader and writer. Can we generate it by querying metadata. In most cases, the field names of the data source and target are the same, but some field types are converted. In this way, we can start tasks through a temporary configuration file, reducing the maintenance work of a large number of configuration files.

@Kick156
Copy link

Kick156 commented Jan 6, 2023

hello,
I have some suggestions,just for reference:
about BitSail Product Usability Optimization,how about integration with streampark,it is an easy-to-use stream processing application development framework and one-stop stream processing operation platform。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: No status
Development

No branches or pull requests

3 participants