elasticsearch jvm-mx meetup april 2016

Post on 11-Jan-2017

540 Views

Category:

Technology

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

ElasticsearchMexico City JVM Group

April 2016

!Gracias por estar aqui¡

¡El meetup con mas asistencia de la historia!

What is the topic tonight?

Elastic search?

NO

Elastic Search?

NO

ElasticSearch?

NO

@superserch?

NO

Elasticsearch

YES

Search

Do I have to elaborate why search is important?

A little history…

Lucene History• Douglass Read "Doug" Cutting wrote Lucene in 1999

• Doug also is the author of Hadoop

• In Lucene other projects came to life

• Mahout

• Tika

• Nutch

Lucene?

• Lucene is an open-source Java full-text search library which makes it easy to add search functionality to an application or website.

Index

Query

Inverted index

• Lucene creates a data structure where it keeps a list of where each word belongs.

Lucene-Based projects

• Solr

• Compass

• Elasticsearch

• Hibernate search

ElasticsearchYou Know, for Search.

Features• Real-Time Data. I (Domingo) say near Real-Time Data.

• Massively Distributed

• High Availability

• Full-Text Search

• Document-Oriented

• Schema-Free

• Developer-Friendly, RESTful API

• Extensible via plugins

Concepts• Cluster

• Node

• Index

• Shard & Replica

• Type

• Mapping

• Document

How data is organized in Elasticsearch

Nodes & shards

Indexing documents

Sharding is crucial

• Shard is a physical Lucene index

• # documents in a Lucent index is 2 billion docs.

• When you create a index you have to declare the # shards, you can’t change later. Beware!

• Don’t try to over-sharding your index! Beware!

Distributed indexing

URL

http://localhost:9200/{index}/{type}/{document_id}

HTTPie for the samples

Creating an index$ http put :9200/my_index/ settings:='{ "index" : { "number_of_shards" : 3, "number_of_replicas" : 0 } }' -v PUT /my_index/ HTTP/1.1 Accept: application/json Accept-Encoding: gzip, deflate Connection: keep-alive Content-Length: 73 Content-Type: application/json Host: localhost:9200 User-Agent: HTTPie/0.9.3

{ "settings": { "index": { "number_of_replicas": 0, "number_of_shards": 3 } } }

HTTP/1.1 200 OK Content-Length: 21 Content-Type: application/json; charset=UTF-8

{ "acknowledged": true }

Creating a type$ http put :9200/my_index/_mapping/my_document properties:='{ "user_name": { "type": "string" } }' -v PUT /my_index/_mapping/my_document1 HTTP/1.1 Accept: application/json Content-Length: 49 Content-Type: application/json

{ "properties": { "user_name": { "type": "string" } } }

HTTP/1.1 200 OK Content-Length: 21 Content-Type: application/json; charset=UTF-8

{ "acknowledged": true }

Indexing$ http :9200/my_index/my_document user_name="Domingo Suarez" -v POST /my_index/my_document1 HTTP/1.1 Content-Length: 31 Content-Type: application/json

{ "user_name": "Domingo Suarez” }

HTTP/1.1 201 Created Content-Length: 149 Content-Type: application/json; charset=UTF-8 { "_id": "AVRaEeBK3Lbw2oDzSIWN", "_index": "my_index", "_shards": { "failed": 0, "successful": 1, "total": 1 }, "_type": "my_document1", "_version": 1, "created": true }

Search $ http :9200/my_index/my_document/_search?q=user_name:Domingo HTTP/1.1 200 OK Content-Length: 657 Content-Type: application/json; charset=UTF-8

{ "_shards": { "failed": 0, "successful": 3, "total": 3 }, "hits": { "hits": [ { "_id": "AVRaEdPJ3Lbw2oDzSIWM", "_index": "my_index", "_score": 0.625, "_source": { "user_name": "Domingo Suarez" }, "_type": "my_document1" } ], "max_score": 0.625, "total": 1 }, "timed_out": false, "took": 5 }

top related