Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[inverted index]In English and Chinese, xx match '' returns different results. #45403

Open
chengqianli-git opened this issue May 10, 2024 · 0 comments
Assignees
Labels
type/bug Something isn't working

Comments

@chengqianli-git
Copy link

chengqianli-git commented May 10, 2024

Steps to reproduce the behavior (Required)

  1. CREATE TABLE '...'
  2. INSERT INTO '....'
  3. SELECT '....'4. 4.
CREATE TABLE `duplicate_table_demo_datatype_not_replicated_all_varchar` (
  `AAA` datetime NOT NULL COMMENT "",
  `BBB` varchar(200) NOT NULL COMMENT "",
  `CCC` varchar(200) NOT NULL COMMENT "",
  `DDD` varchar(2000) NULL COMMENT "",
  `EEE` largeint(40) NULL COMMENT "",
  `FFF` decimal(20, 10) NULL COMMENT "",
  `GGG` varchar(200) NULL COMMENT "",
  `HHH` float NULL COMMENT "",
  `III` boolean NULL COMMENT "",
  `KKK` char(20) NULL COMMENT "",
  `LLL` varchar(65533) NULL COMMENT "",
  `MMM` varchar(20) NULL COMMENT "",
  `NNN` varbinary NULL COMMENT "",
  `OOO` tinyint(4) NULL COMMENT "",
  `PPP` datetime NULL COMMENT "",
  `QQQ` array<int(11)> NULL COMMENT "",
  `RRR` json NULL COMMENT "",
  `SSS` map<int(11),int(11)> NULL COMMENT "",
  `TTT` struct<a int(11), b int(11)> NULL COMMENT "",
  INDEX init_bitmap_index (`KKK`) USING BITMAP COMMENT '',
  INDEX idx (`DDD`) USING GIN("parser" = "chinese") COMMENT ''
) ENGINE=OLAP
DUPLICATE KEY(`AAA`, `BBB`, `CCC`)
PARTITION BY RANGE(`AAA`)
(PARTITION p1970 VALUES [("1970-01-01 00:00:00"), ("2000-01-01 00:00:00")),
PARTITION p2000 VALUES [("2000-01-01 00:00:00"), ("2030-01-01 00:00:00")))
DISTRIBUTED BY HASH(`AAA`, `BBB`) BUCKETS 3
ORDER BY(`AAA`, `BBB`, `CCC`, `DDD`)
PROPERTIES (
"bloom_filter_columns" = "MMM",
"compression" = "LZ4",
"fast_schema_evolution" = "true",
"replicated_storage" = "false",
"replication_num" = "3",
"unique_constraints" = "default_catalog.test_inverted_indexa4a35500_0e77_11ef_b07c_00163e21975a.duplicate_table_demo_datatype_not_replicated_all_varchar.GGG"
);

insert five data,
select * from duplicate_table_demo_datatype_not_replicated_all_varchar where DDD match '';

Expected behavior (Required)

Real behavior (Required)

return 1 row
but if parser is English, return 0 row

StarRocks version (Required)

  • You can get the StarRocks version by executing SQL select current_version()
@chengqianli-git chengqianli-git added the type/bug Something isn't working label May 10, 2024
@chengqianli-git chengqianli-git changed the title [inverte index]In English and Chinese, xx match '' returns different results. [inverted index]In English and Chinese, xx match '' returns different results. May 10, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type/bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants