在做导入的时候导入重复了,现在根据‘商品id’和'来源'两个字段能确定出来重复的数据。但是怎么清楚掉不知道怎么办,求助大佬。
POST search_product/_search
{
"size": 0,
"aggs": {
"duplicateCount": {
"terms":{
"script": "doc['productFromType.keyword'].value + '#' + doc['shopProductId'].value + '#' ",
"min_doc_count": 2
},
"aggs": {
"duplicateDocuments": {
"top_hits": {}
}
}
}
}
}
POST search_product/_search
{
"size": 0,
"aggs": {
"duplicateCount": {
"terms":{
"script": "doc['productFromType.keyword'].value + '#' + doc['shopProductId'].value + '#' ",
"min_doc_count": 2
},
"aggs": {
"duplicateDocuments": {
"top_hits": {}
}
}
}
}
}
1 个回复
AiToMaKoTo - Elasticsearch.永远滴神
赞同来自: