ElasticSearch IK Tokenizer Quick Start

ElasticSearch IK Tokenizer Quick Start

1. Install IK word segmenter

1. Allocate a pseudo terminal

My ElasticSearch is installed using Docker, so first assign a pseudo terminal to the container. Then you can directly operate the contents in Docker just like logging into the server.
docker exec -it container ID /bin/bash

2. Install the plugin using elasticsearch-plugin

cd plugins Enter the plugins folder and execute the following command. 7.12.0 needs to be changed to your es version number

../bin/elasticsearch-plugin install https://github.com/medcl/elasticsearch-analysis-ik/releases/download/v7.12.0/elasticsearch-analysis-ik-7.12.0.zip

3. Restart the elasticsearch container

docker restart container id

4. Frequently Asked Questions

(1)java.lang.IllegalStateException

When executing elasticsearch-plugin install, Failed installing and java.lang.IllegalStateException appeared

I checked and found that there is already a folder named ik in my plugins/ directory.
With a try-it attitude, I deleted the folder named ik and re-executed

../bin/elasticsearch-plugin install https://github.com/medcl/elasticsearch-analysis-ik/releases/download/v7.12.0/elasticsearch-analysis-ik-7.12.0.zip

Installation Successful

2. ik word segmentation principle

(Placeholder, I will add more when I have time to read the source code)

3. ik_smart and ik_max_word

GET _analyze?pretty{ "analyzer": "ik_smart", "text": "Unlike me, I only feel sorry for pigeons"}

GET _analyze?pretty{ "analyzer": "ik_max_word", "text": "Unlike me, I only feel sorry for pigeons"}

in conclusion

_analyzer builds the index when the word segmentation, when the index uses max_word
search_analyzer uses word segmentation when searching and smart when querying

<<:  Development is accelerating again, and 5G plans and goals for major provinces and cities in my country are released!

>>:  Using Jenkins to create continuous integration for microservice applications

Recommend

Improving efficiency and reliability using SDN in multi-layer networks

Abstraction is a big issue in Software Defined Ne...

Regarding the ocean, we actually have a choice...

There are ten thousand ways for us to live in pea...

Top 10 Web Trends: How to Stay Ahead in 2021

[[422976]] 【51CTO.com Quick Translation】 Due to t...

5G messaging is entering a development phase

As one of the earliest attempts at 5G commercial ...

...

How will the emergence of 5G affect AI federated learning?

As development teams scramble to build AI tools, ...

...

5G brings unlimited opportunities to different industries

The workplace of tomorrow will look different tha...

Let’s talk about 6G development

As of now, the fifth generation (5G) mobile commu...

IPv4 scarcity threatens Internet development

RIPENCC, the regional internet registry for Europ...

UDP, you need to feed the mouse!

[[353775]] The transport layer is located between...