ElasticSearch IK Tokenizer Quick Start

ElasticSearch IK Tokenizer Quick Start

1. Install IK word segmenter

1. Allocate a pseudo terminal

My ElasticSearch is installed using Docker, so first assign a pseudo terminal to the container. Then you can directly operate the contents in Docker just like logging into the server.
docker exec -it container ID /bin/bash

2. Install the plugin using elasticsearch-plugin

cd plugins Enter the plugins folder and execute the following command. 7.12.0 needs to be changed to your es version number

../bin/elasticsearch-plugin install https://github.com/medcl/elasticsearch-analysis-ik/releases/download/v7.12.0/elasticsearch-analysis-ik-7.12.0.zip

3. Restart the elasticsearch container

docker restart container id

4. Frequently Asked Questions

(1)java.lang.IllegalStateException

When executing elasticsearch-plugin install, Failed installing and java.lang.IllegalStateException appeared

I checked and found that there is already a folder named ik in my plugins/ directory.
With a try-it attitude, I deleted the folder named ik and re-executed

../bin/elasticsearch-plugin install https://github.com/medcl/elasticsearch-analysis-ik/releases/download/v7.12.0/elasticsearch-analysis-ik-7.12.0.zip

Installation Successful

2. ik word segmentation principle

(Placeholder, I will add more when I have time to read the source code)

3. ik_smart and ik_max_word

GET _analyze?pretty{ "analyzer": "ik_smart", "text": "Unlike me, I only feel sorry for pigeons"}

GET _analyze?pretty{ "analyzer": "ik_max_word", "text": "Unlike me, I only feel sorry for pigeons"}

in conclusion

_analyzer builds the index when the word segmentation, when the index uses max_word
search_analyzer uses word segmentation when searching and smart when querying

<<:  Development is accelerating again, and 5G plans and goals for major provinces and cities in my country are released!

>>:  Using Jenkins to create continuous integration for microservice applications

Blog    

Recommend

In addition to 404, what other "codes" are there for web pages?

404 Not Found When surfing the Internet, whether ...

Diagram: 5G millimeter wave peak rate calculation

[[390044]] This article is reprinted from the WeC...

Managing a growing API portfolio

We have previously discussed the importance of AP...

5G - the future network technology for all applications

As 5G is being promoted and deployed around the w...

EtherNetservers: $12/year-1GB/30GB/2TB/2IP/Los Angeles

EtherNetservers is a foreign VPS hosting company ...