试试搜索一下吧

社区日报 第1577期 (2023-02-22)

1.探究 | Elasticsearch Painless 脚本 ctx、doc、_source 的区别是什么?
https://mp.weixin.qq.com/s/ibk78SQw8JHuDUq5ZCr_8w 
2.ES 的 Keyword/Fingerprint/Pattern 分词器介绍(需要梯子)
https://mkonda007.medium.com/e ... 4801e
3.Elasticsearch:在满意度调查中实现并使用情绪分析器
https://blog.csdn.net/UbuntuTo ... 20283


编辑:kin122
归档:https://ela.st/cn-daily-all
订阅:https://ela.st/cn-daily-sub
沙龙:https://ela.st/cn-meetup
B站:https://ela.st/bilibili
继续阅读 »
1.探究 | Elasticsearch Painless 脚本 ctx、doc、_source 的区别是什么?
https://mp.weixin.qq.com/s/ibk78SQw8JHuDUq5ZCr_8w 
2.ES 的 Keyword/Fingerprint/Pattern 分词器介绍(需要梯子)
https://mkonda007.medium.com/e ... 4801e
3.Elasticsearch:在满意度调查中实现并使用情绪分析器
https://blog.csdn.net/UbuntuTo ... 20283


编辑:kin122
归档:https://ela.st/cn-daily-all
订阅:https://ela.st/cn-daily-sub
沙龙:https://ela.st/cn-meetup
B站:https://ela.st/bilibili 收起阅读 »

​社区日报 第1576期 (2023-02-21)


1. Bucket 聚合从入门到精通https://medium.com/%40dzenan.d ... 871f0

2. 用Elastic cloud搞个搜索引擎?so easy
https://medium.com/%40charukar ... 22cdb

3. 搞搜索该倚赖文本还是向量?
https://towardsdatascience.com ... 6132a

编辑:斯蒂文
归档:https://ela.st/cn-daily-all
订阅:https://ela.st/cn-daily-sub
沙龙:https://ela.st/cn-meetup
B站: https://ela.st/bilibili
继续阅读 »

1. Bucket 聚合从入门到精通https://medium.com/%40dzenan.d ... 871f0

2. 用Elastic cloud搞个搜索引擎?so easy
https://medium.com/%40charukar ... 22cdb

3. 搞搜索该倚赖文本还是向量?
https://towardsdatascience.com ... 6132a

编辑:斯蒂文
归档:https://ela.st/cn-daily-all
订阅:https://ela.st/cn-daily-sub
沙龙:https://ela.st/cn-meetup
B站: https://ela.st/bilibili 收起阅读 »

社区日报 第1575期 (2023-2-20)

1. 通过Function Score优化查询结果
   http://www.scienjus.com/elasti ... uery/
2. Elasticsearch 为什么那么快
   https://www.jianshu.com/p/b50d7fdbe544/
3. Elasticsearch并发控制及乐观锁实现原理
   https://zhuanlan.zhihu.com/p/95460292
编辑:yuebancanghai
归档:https://ela.st/cn-daily-all
订阅:https://ela.st/cn-daily-sub
沙龙:https://ela.st/cn-meetup
B站:https://ela.st/bilibili
继续阅读 »
1. 通过Function Score优化查询结果
   http://www.scienjus.com/elasti ... uery/
2. Elasticsearch 为什么那么快
   https://www.jianshu.com/p/b50d7fdbe544/
3. Elasticsearch并发控制及乐观锁实现原理
   https://zhuanlan.zhihu.com/p/95460292
编辑:yuebancanghai
归档:https://ela.st/cn-daily-all
订阅:https://ela.st/cn-daily-sub
沙龙:https://ela.st/cn-meetup
B站:https://ela.st/bilibili 收起阅读 »

社区日报 第1574期 (2023-02-16)


1.通过 Profile API 和 Slow log 分析 Elasticsearch 查询
https://coralogix.com/blog/imp ... logs/
2.BooleanQuery 介绍
https://www.amazingkoala.com.c ... .html
3.Elasticsearch Ingest Pipeline 101
https://hevodata.com/learn/ela ... %23t7

编辑:Se7en
归档:https://ela.st/cn-daily-all
订阅:https://ela.st/cn-daily-sub
沙龙:https://ela.st/cn-meetup
B站: https://ela.st/bilibili
继续阅读 »

1.通过 Profile API 和 Slow log 分析 Elasticsearch 查询
https://coralogix.com/blog/imp ... logs/
2.BooleanQuery 介绍
https://www.amazingkoala.com.c ... .html
3.Elasticsearch Ingest Pipeline 101
https://hevodata.com/learn/ela ... %23t7

编辑:Se7en
归档:https://ela.st/cn-daily-all
订阅:https://ela.st/cn-daily-sub
沙龙:https://ela.st/cn-meetup
B站: https://ela.st/bilibili 收起阅读 »

社区日报 第1573期 (2023-02-15)

1.腾讯云大数据ES Lucene压缩编码深度优化大揭秘
https://mp.weixin.qq.com/s/eIy1Tv1Teonl2HWtvPVUZg 
2.ES 从存储效率上怎么节省成本
https://www.elastic.co/cn/blog ... -7-10
3.Elasticsearch:在搜索中使用衰减函数(Gauss)
https://blog.csdn.net/UbuntuTo ... 55263


编辑:kin122
归档:https://ela.st/cn-daily-all
订阅:https://ela.st/cn-daily-sub
沙龙:https://ela.st/cn-meetup
B站:https://ela.st/bilibili
继续阅读 »
1.腾讯云大数据ES Lucene压缩编码深度优化大揭秘
https://mp.weixin.qq.com/s/eIy1Tv1Teonl2HWtvPVUZg 
2.ES 从存储效率上怎么节省成本
https://www.elastic.co/cn/blog ... -7-10
3.Elasticsearch:在搜索中使用衰减函数(Gauss)
https://blog.csdn.net/UbuntuTo ... 55263


编辑:kin122
归档:https://ela.st/cn-daily-all
订阅:https://ela.st/cn-daily-sub
沙龙:https://ela.st/cn-meetup
B站:https://ela.st/bilibili 收起阅读 »

社区日报 第1572期 (2023-02-14)

大家情人节快乐~
1. function_score 小例子(需要梯子)
https://medium.com/%40andre.lu ... e07a1
2. 从MySQL到Elasticsearch数据同步(需要梯子)
https://towardsdatascience.com ... 7b339
3. 我们是如何用ES来改造21岁的XX系统的(需要梯子)
https://medium.com/%40s_nikola ... e4551
 
编辑:斯蒂文
归档:https://ela.st/cn-daily-all
订阅:https://ela.st/cn-daily-sub
沙龙:https://ela.st/cn-meetup
B站: https://ela.st/bilibili
 
继续阅读 »
大家情人节快乐~
1. function_score 小例子(需要梯子)
https://medium.com/%40andre.lu ... e07a1
2. 从MySQL到Elasticsearch数据同步(需要梯子)
https://towardsdatascience.com ... 7b339
3. 我们是如何用ES来改造21岁的XX系统的(需要梯子)
https://medium.com/%40s_nikola ... e4551
 
编辑:斯蒂文
归档:https://ela.st/cn-daily-all
订阅:https://ela.st/cn-daily-sub
沙龙:https://ela.st/cn-meetup
B站: https://ela.st/bilibili
  收起阅读 »

社区日报 第1571期 (2023-2-13)

1. Elasticsearch 自定义词库热更新
   https://www.cnblogs.com/fengwe ... .html
2. 搜索服务在APP搜索场景的应用
   https://bbs.huaweicloud.com/blogs/114503
3. Elasticsearch汉字补全和拼写纠错
   https://it.cha138.com/mysql/show-86965.html
编辑:yuebancanghai
归档:https://ela.st/cn-daily-all
订阅:https://ela.st/cn-daily-sub
沙龙:https://ela.st/cn-meetup
B站:https://ela.st/bilibili
继续阅读 »
1. Elasticsearch 自定义词库热更新
   https://www.cnblogs.com/fengwe ... .html
2. 搜索服务在APP搜索场景的应用
   https://bbs.huaweicloud.com/blogs/114503
3. Elasticsearch汉字补全和拼写纠错
   https://it.cha138.com/mysql/show-86965.html
编辑:yuebancanghai
归档:https://ela.st/cn-daily-all
订阅:https://ela.st/cn-daily-sub
沙龙:https://ela.st/cn-meetup
B站:https://ela.st/bilibili 收起阅读 »

社区日报 第1570期 (2023-02-10)


1、使用 Logstash 将数据从 ElasticSearch 迁移到 微软的Azure Data Explorer (ADX)
https://techcommunity.microsof ... 22397
2、PostgreSQL 的全文检索及应用
https://dev.to/thegnarco/postg ... h-f5c
3、时序方式管理索引
https://dev.to/sandeepkanabar/ ... -1ebl

编辑:铭毅天下
归档:https://ela.st/cn-daily-all
订阅:https://ela.st/cn-daily-sub
沙龙:https://ela.st/cn-meetup
B站: https://ela.st/bilibili
继续阅读 »

1、使用 Logstash 将数据从 ElasticSearch 迁移到 微软的Azure Data Explorer (ADX)
https://techcommunity.microsof ... 22397
2、PostgreSQL 的全文检索及应用
https://dev.to/thegnarco/postg ... h-f5c
3、时序方式管理索引
https://dev.to/sandeepkanabar/ ... -1ebl

编辑:铭毅天下
归档:https://ela.st/cn-daily-all
订阅:https://ela.st/cn-daily-sub
沙龙:https://ela.st/cn-meetup
B站: https://ela.st/bilibili 收起阅读 »

INFINI Gateway和 Console 更新发布啦!

INFINI Labs 产品更新发布

Hi,大家好。今天 INFINI Labs 为大家带来 2023 春节后第一波产品更新发布,欢迎大家免费下载体验使用。

INFINI Gateway v1.9.0

极限网关本次迭代带来了大量的更新如下:

Breaking changes

  • Refactoring config for ip access control
  • Disable elasticsearch metadata refresh by default
  • Update default config path from configs to config
  • Remove sample-configs, moved to dedicated integrated-testing project
  • Remove field conntime, update field @timestamp to timestamp in logging filter
  • Rename disorder to fast

Features

  • Support listen on IPv6 address
  • Add general health api
  • Add request_ip to context
  • Add badger filter plugin
  • Allow to split produce and consume messages from s3
  • Add bulk_request_throttle filter
  • Support access request context and more output options in echo filter
  • Add body_json to response context
  • Add cert config to API module, support mTLS
  • Add api to clear scroll context
  • Floating_ip support stick by priority
  • Add keystore util
  • Allow to save success bulk results in bulk_indexing processor
  • Enable watch and reload the major config file
  • Support run background job in one goroutine
  • Allow to handle async_bulk request logging
  • Add config to control cluster health check while cluster not available, set default to false
  • Allow to follow redirects in http filter, set default read and write timeout to 30s
  • Support collect instance metrics to monitoring gateway
  • Add json log format

Bug fix

  • Fix user was removed in logging filter
  • Fix incorrect message size issue, reload when files changed in disk_queue
  • Fix issue that index_diff could not finished automatically
  • Fix hostname was not well updated in filter set_request_header or set_hostname
  • Fix to check consumer’s lag instead of queue’s lag in flow_runner processor
  • Fix file not found error for disk_queue
  • Fix the delete requests was not proper handled in filter bulk_reshuffle, bulk_request_mutate and bulk_indexing processor
  • Fix memory leak caused by misuse of bytes buffer
  • Fix to handle the last request in replay processor
  • Fix url args was not updated after change
  • Fix memory leak when serving high-concurrent requests
  • Fix nil id caused error when using sliced workers in bulk_indexing processor
  • Fix index name with dot
  • Refactoring time fields for orm, skip empty time
  • Refactoring stats, allow to register extended stats
  • Fix to restart gateway entrypoint on flow change
  • Update ratio filter, fix random number, add header to ratio filter
  • Fix query parameter no_cache was not well respected in get_cache filter
  • Fix single delete request was ignored in bulk requests
  • Fix request mutate filter

Improvements

  • Remove newline in indexing_merge and json_indexing processor
  • Improve instance check, add config to disable
  • Add option skip_insecure_verify to s3 module
  • Improve instance check, enable config to disable
  • Update the way to get ctx process info, optimize memory usage
  • Improve indexing performance for bulk_indexing processor
  • Refactoring disk_queue, speedup message consumption
  • Enable segment compress for disk_queue by default
  • Skip download s3 files when s3 was not enabled
  • Add option to log warning messages for throttle filters
  • Optimize hash performance for getting primary shardID and partitionID
  • Add cache for get index routing table
  • Optimize performance for bulk response processing
  • Refactoring bulk_processor, pass meta info to payload func
  • Don’t call payload func for delete action
  • Improve queue consumer’s lag check
  • Enable prepare flat files ahead for read by default, skip unnecessary file
  • Add object pool for xxhash
  • Refactoring disk_queue, handle consumer in-flight segments in memory
  • Add config to remove duplicated newline for bulk_processor
  • Add metric timestamp in stats api
  • Improve error on routing table missing
  • Refactoring bytes buffer and object pool, expose metrics via API
  • Refactoring tasks pooling, support throttle and unified control
  • Optimize badger file size and memory usage
  • Refactoring time fields for orm, skip empty time
  • Refactoring stats, allow to register extended stats
  • Refactoring to handle bulk response results
  • Add client_session_cache_size to tls setting
  • Safety add newline to each bytes when handle bulk requests

INFINI Console v0.7.0

INFINI Console 本次迭代更新如下:

  • 新增初始化安装向导;

    image.png

  • 新增系统服务健康监控;
  • 新增 License 授权;

    image_(1).png

  • 新增索引和节点层面数据字节写入吞吐量指标(indexing bytes);
  • 修复了 Discover 第一次加载未发起搜索请求的问题;
  • 修复了查看节点线程池指标时选择多个节点后指标不显示的问题;

期待反馈

欢迎下载体验使用,如果您在使用过程中遇到如何疑问或者问题,欢迎前往 INFINI Labs Github(https://github.com/infinilabs) 中的对应项目中提交 Feature Request 或提交 Bug。

您还可以通过邮件联系我们:hello@infini.ltd

或者拨打我们的热线电话:(+86) 400-139-9200

也欢迎大家添加微信小助手(INFINI-Labs)拉群交流和学习。

640.jpeg

感谢大家的围观,祝大家周末愉快。

继续阅读 »

INFINI Labs 产品更新发布

Hi,大家好。今天 INFINI Labs 为大家带来 2023 春节后第一波产品更新发布,欢迎大家免费下载体验使用。

INFINI Gateway v1.9.0

极限网关本次迭代带来了大量的更新如下:

Breaking changes

  • Refactoring config for ip access control
  • Disable elasticsearch metadata refresh by default
  • Update default config path from configs to config
  • Remove sample-configs, moved to dedicated integrated-testing project
  • Remove field conntime, update field @timestamp to timestamp in logging filter
  • Rename disorder to fast

Features

  • Support listen on IPv6 address
  • Add general health api
  • Add request_ip to context
  • Add badger filter plugin
  • Allow to split produce and consume messages from s3
  • Add bulk_request_throttle filter
  • Support access request context and more output options in echo filter
  • Add body_json to response context
  • Add cert config to API module, support mTLS
  • Add api to clear scroll context
  • Floating_ip support stick by priority
  • Add keystore util
  • Allow to save success bulk results in bulk_indexing processor
  • Enable watch and reload the major config file
  • Support run background job in one goroutine
  • Allow to handle async_bulk request logging
  • Add config to control cluster health check while cluster not available, set default to false
  • Allow to follow redirects in http filter, set default read and write timeout to 30s
  • Support collect instance metrics to monitoring gateway
  • Add json log format

Bug fix

  • Fix user was removed in logging filter
  • Fix incorrect message size issue, reload when files changed in disk_queue
  • Fix issue that index_diff could not finished automatically
  • Fix hostname was not well updated in filter set_request_header or set_hostname
  • Fix to check consumer’s lag instead of queue’s lag in flow_runner processor
  • Fix file not found error for disk_queue
  • Fix the delete requests was not proper handled in filter bulk_reshuffle, bulk_request_mutate and bulk_indexing processor
  • Fix memory leak caused by misuse of bytes buffer
  • Fix to handle the last request in replay processor
  • Fix url args was not updated after change
  • Fix memory leak when serving high-concurrent requests
  • Fix nil id caused error when using sliced workers in bulk_indexing processor
  • Fix index name with dot
  • Refactoring time fields for orm, skip empty time
  • Refactoring stats, allow to register extended stats
  • Fix to restart gateway entrypoint on flow change
  • Update ratio filter, fix random number, add header to ratio filter
  • Fix query parameter no_cache was not well respected in get_cache filter
  • Fix single delete request was ignored in bulk requests
  • Fix request mutate filter

Improvements

  • Remove newline in indexing_merge and json_indexing processor
  • Improve instance check, add config to disable
  • Add option skip_insecure_verify to s3 module
  • Improve instance check, enable config to disable
  • Update the way to get ctx process info, optimize memory usage
  • Improve indexing performance for bulk_indexing processor
  • Refactoring disk_queue, speedup message consumption
  • Enable segment compress for disk_queue by default
  • Skip download s3 files when s3 was not enabled
  • Add option to log warning messages for throttle filters
  • Optimize hash performance for getting primary shardID and partitionID
  • Add cache for get index routing table
  • Optimize performance for bulk response processing
  • Refactoring bulk_processor, pass meta info to payload func
  • Don’t call payload func for delete action
  • Improve queue consumer’s lag check
  • Enable prepare flat files ahead for read by default, skip unnecessary file
  • Add object pool for xxhash
  • Refactoring disk_queue, handle consumer in-flight segments in memory
  • Add config to remove duplicated newline for bulk_processor
  • Add metric timestamp in stats api
  • Improve error on routing table missing
  • Refactoring bytes buffer and object pool, expose metrics via API
  • Refactoring tasks pooling, support throttle and unified control
  • Optimize badger file size and memory usage
  • Refactoring time fields for orm, skip empty time
  • Refactoring stats, allow to register extended stats
  • Refactoring to handle bulk response results
  • Add client_session_cache_size to tls setting
  • Safety add newline to each bytes when handle bulk requests

INFINI Console v0.7.0

INFINI Console 本次迭代更新如下:

  • 新增初始化安装向导;

    image.png

  • 新增系统服务健康监控;
  • 新增 License 授权;

    image_(1).png

  • 新增索引和节点层面数据字节写入吞吐量指标(indexing bytes);
  • 修复了 Discover 第一次加载未发起搜索请求的问题;
  • 修复了查看节点线程池指标时选择多个节点后指标不显示的问题;

期待反馈

欢迎下载体验使用,如果您在使用过程中遇到如何疑问或者问题,欢迎前往 INFINI Labs Github(https://github.com/infinilabs) 中的对应项目中提交 Feature Request 或提交 Bug。

您还可以通过邮件联系我们:hello@infini.ltd

或者拨打我们的热线电话:(+86) 400-139-9200

也欢迎大家添加微信小助手(INFINI-Labs)拉群交流和学习。

640.jpeg

感谢大家的围观,祝大家周末愉快。

收起阅读 »

社区日报 第1569期 (2023-02-09)

1.Elasticsearch 集群异常问题排查方法
https://cloud.tencent.com/docu ... 56281
2.42 个 Elasticsearch 查询示例
https://coralogix.com/blog/42- ... rial/
3.优化 Elasticsearch 查询的案例(需要梯子)
https://medium.com/%40pawansin ... 3c99f

编辑:Se7en
归档:https://ela.st/cn-daily-all
订阅:https://ela.st/cn-daily-sub
沙龙:https://ela.st/cn-meetup
B站: https://ela.st/bilibili
继续阅读 »
1.Elasticsearch 集群异常问题排查方法
https://cloud.tencent.com/docu ... 56281
2.42 个 Elasticsearch 查询示例
https://coralogix.com/blog/42- ... rial/
3.优化 Elasticsearch 查询的案例(需要梯子)
https://medium.com/%40pawansin ... 3c99f

编辑:Se7en
归档:https://ela.st/cn-daily-all
订阅:https://ela.st/cn-daily-sub
沙龙:https://ela.st/cn-meetup
B站: https://ela.st/bilibili 收起阅读 »

社区日报 第1568期 (2023-02-08)

1.Elasticesearch内存详解
https://developer.aliyun.com/article/979413
2.Logstash:在实施之前测试 Logstash 管道/过滤器
https://blog.csdn.net/UbuntuTo ... 09116
3.关于 ES 客户端嗅探的最佳实践
https://www.elastic.co/cn/blog ... y-how


编辑:kin122
归档:https://ela.st/cn-daily-all
订阅:https://ela.st/cn-daily-sub
沙龙:https://ela.st/cn-meetup
B站:https://ela.st/bilibili
继续阅读 »
1.Elasticesearch内存详解
https://developer.aliyun.com/article/979413
2.Logstash:在实施之前测试 Logstash 管道/过滤器
https://blog.csdn.net/UbuntuTo ... 09116
3.关于 ES 客户端嗅探的最佳实践
https://www.elastic.co/cn/blog ... y-how


编辑:kin122
归档:https://ela.st/cn-daily-all
订阅:https://ela.st/cn-daily-sub
沙龙:https://ela.st/cn-meetup
B站:https://ela.st/bilibili 收起阅读 »

社区日报 第1567期 (2023-02-07)


1. 打通数据链路,从CSV到ES(需要梯子)
https://medium.com/%40sumukhi. ... d4c99

2. 全解ES之k8s安全设置(需要梯子)
https://blog.searce.com/deploy ... 1ddf5

3. 详解ES之同义词/近义词(需要梯子)
https://medium.com/version-1/s ... ba8a5

编辑:斯蒂文
归档:https://ela.st/cn-daily-all
订阅:https://ela.st/cn-daily-sub
沙龙:https://ela.st/cn-meetup
B站: https://ela.st/bilibili
继续阅读 »

1. 打通数据链路,从CSV到ES(需要梯子)
https://medium.com/%40sumukhi. ... d4c99

2. 全解ES之k8s安全设置(需要梯子)
https://blog.searce.com/deploy ... 1ddf5

3. 详解ES之同义词/近义词(需要梯子)
https://medium.com/version-1/s ... ba8a5

编辑:斯蒂文
归档:https://ela.st/cn-daily-all
订阅:https://ela.st/cn-daily-sub
沙龙:https://ela.st/cn-meetup
B站: https://ela.st/bilibili 收起阅读 »

社区日报 第1566期 (2023-2-06)

1. Elasticsearch内核解析 - 数据模型篇
   https://zhuanlan.zhihu.com/p/34680841
2. es实战-收集Nginx日志可视化监控
   https://www.jianshu.com/p/66210e4a6ee8
3. Elasticsearch内核解析 - 写入解析
   https://zhuanlan.zhihu.com/p/34875310
编辑:yuebancanghai
归档:https://ela.st/cn-daily-all
订阅:https://ela.st/cn-daily-sub
沙龙:https://ela.st/cn-meetup
B站:https://ela.st/bilibili
继续阅读 »
1. Elasticsearch内核解析 - 数据模型篇
   https://zhuanlan.zhihu.com/p/34680841
2. es实战-收集Nginx日志可视化监控
   https://www.jianshu.com/p/66210e4a6ee8
3. Elasticsearch内核解析 - 写入解析
   https://zhuanlan.zhihu.com/p/34875310
编辑:yuebancanghai
归档:https://ela.st/cn-daily-all
订阅:https://ela.st/cn-daily-sub
沙龙:https://ela.st/cn-meetup
B站:https://ela.st/bilibili 收起阅读 »

【内推】【华为云】Elasticsearch内核开发工程师

工作职责:
1. 参与云端搜索引擎、万亿级记录的搜索系统实现
2. 算法、索引、系统架构方面参与挑战业界极限的技术攻关
3. 向量检索算法开发
4. 推荐与搜索的结合开发

优选条件:
1. 熟悉搜索引擎或者向量检索,对索引结构,引擎架构,分布式引擎有经验者优先
2. 熟悉Elasticsearch、Lucene系统优先
3. 熟悉向量检索算法
4. 熟悉推荐系统工程与算法
 
工作地点: 深圳/西安
 
联系: niklaus.xiao@huawei.com
phone/vchar: 17688760968
 
团队介绍:
云搜索服务团队,从事搜索引擎的自研以及开源Elasticsearch/Lucene等组件的内核优化能力。
在追求市场拓展的前提下,聚焦于市场需要的技术突破,当前已经在向量检索、超低成本、索引Build性能、千人千面等领域取得领先于业界的优势技术。
期望能够找到志同道合的朋友一起,为提升搜索技术的应用范围以及不断突破技术极限而努力。团队会对你的技术成长和职业成长负责。
继续阅读 »
工作职责:
1. 参与云端搜索引擎、万亿级记录的搜索系统实现
2. 算法、索引、系统架构方面参与挑战业界极限的技术攻关
3. 向量检索算法开发
4. 推荐与搜索的结合开发

优选条件:
1. 熟悉搜索引擎或者向量检索,对索引结构,引擎架构,分布式引擎有经验者优先
2. 熟悉Elasticsearch、Lucene系统优先
3. 熟悉向量检索算法
4. 熟悉推荐系统工程与算法
 
工作地点: 深圳/西安
 
联系: niklaus.xiao@huawei.com
phone/vchar: 17688760968
 
团队介绍:
云搜索服务团队,从事搜索引擎的自研以及开源Elasticsearch/Lucene等组件的内核优化能力。
在追求市场拓展的前提下,聚焦于市场需要的技术突破,当前已经在向量检索、超低成本、索引Build性能、千人千面等领域取得领先于业界的优势技术。
期望能够找到志同道合的朋友一起,为提升搜索技术的应用范围以及不断突破技术极限而努力。团队会对你的技术成长和职业成长负责。 收起阅读 »

招聘搜索引擎内核研发工程师(Rust方向)

Hello 大家好,开工大吉,兔年吉祥,站长来此招募小伙伴啦,加入我们一起打造下一代的实时搜索引擎。
 
岗位职责
  • 设计并开发下一代实时搜索引擎 ;
  • 持续优化实现方案,改进组件性能 ;
  • 保证工程质量和开发效率 。

 
岗位要求
  • 3 年以上搜索引擎开发经验,计算机相关专业,本科及以上学历 ;
  • 熟练掌握 Rust/C/C++/Golang 中的一种或多种语言,有 Rust 实际开发经验者优先 ;
  • 熟悉 Linux 操作系统,了解Linux系统常用操作命令, 能基于shell编写脚本 ;
  • 熟悉 Linux 下内存管理机制,低延迟、高并发无锁化编程 ;
  • 熟悉 TCP/IP、Socket、HTTP 等网络协议 ;
  • 具有良好的沟通、团队协作能力;
  • 熟悉常见分布式算法,有大规模分布式系统开发经验优先;
  • 较好的英文阅读和写作能力,具备比较强的逻辑思维能力;
  • 良好的编码习惯和技术文档能力,具备持续输出的能力;
  • 工作地点不限 。

 
加分项
  • 有自己的博客、Github、开源项目优先 ;
  • 具有相关搜索引擎开发工作经验者优先 ;
  • 熟悉各类索引结构;
  • 熟悉 LSM-Tree、B+Tree、RocksDB、LevelDB 优先 ;
  • 有较强的学习能力,愿意致力于新技术的研究 。

 
更多信息请访问极限实验室官网: https://www.infinilabs.com/career/
继续阅读 »
Hello 大家好,开工大吉,兔年吉祥,站长来此招募小伙伴啦,加入我们一起打造下一代的实时搜索引擎。
 
岗位职责
  • 设计并开发下一代实时搜索引擎 ;
  • 持续优化实现方案,改进组件性能 ;
  • 保证工程质量和开发效率 。

 
岗位要求
  • 3 年以上搜索引擎开发经验,计算机相关专业,本科及以上学历 ;
  • 熟练掌握 Rust/C/C++/Golang 中的一种或多种语言,有 Rust 实际开发经验者优先 ;
  • 熟悉 Linux 操作系统,了解Linux系统常用操作命令, 能基于shell编写脚本 ;
  • 熟悉 Linux 下内存管理机制,低延迟、高并发无锁化编程 ;
  • 熟悉 TCP/IP、Socket、HTTP 等网络协议 ;
  • 具有良好的沟通、团队协作能力;
  • 熟悉常见分布式算法,有大规模分布式系统开发经验优先;
  • 较好的英文阅读和写作能力,具备比较强的逻辑思维能力;
  • 良好的编码习惯和技术文档能力,具备持续输出的能力;
  • 工作地点不限 。

 
加分项
  • 有自己的博客、Github、开源项目优先 ;
  • 具有相关搜索引擎开发工作经验者优先 ;
  • 熟悉各类索引结构;
  • 熟悉 LSM-Tree、B+Tree、RocksDB、LevelDB 优先 ;
  • 有较强的学习能力,愿意致力于新技术的研究 。

 
更多信息请访问极限实验室官网: https://www.infinilabs.com/career/ 收起阅读 »