Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

kudu维表join 出来的数据会有重复 #346

Open
toratto opened this issue Aug 14, 2020 · 3 comments
Open

kudu维表join 出来的数据会有重复 #346

toratto opened this issue Aug 14, 2020 · 3 comments

Comments

@toratto
Copy link

toratto commented Aug 14, 2020

---kudu维表语句
CREATE TABLE side_rt_d_spu_sku_relation(
spu_id int,
sku_ids varchar,
PRIMARY KEY(spu_id),
PERIOD FOR SYSTEM_TIME
)WITH(
type ='kudu',
master ='xxxxxx'
tableName='d_goods_wlm_sku_rel',
cache ='LRU',
cacheSize ='10000',
cacheTTLMs ='60000',
parallelism ='1',
partitionedJoin='false'
);

@a49a
Copy link

a49a commented Aug 21, 2020

还需要更详细的描述,提供最小复现SQL。

@aptx4869hrj
Copy link

还需要更详细的描述,提供最小复现SQL。

有具体例子吗

@Rowen110
Copy link

如果你的维度表或者事实表本身有重复的话 需要先去重 然后在join

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants