Interview Questions and Answers
Freshers / Beginner level questions & answers
Ques 1. What is Elasticsearch?
Elasticsearch is a distributed search and analytics engine built on top of Apache Lucene.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 2. Explain the term 'Index' in Elasticsearch.
An index in Elasticsearch is a collection of documents that share a similar structure and are stored together for efficient searching.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 3. What is a Document in Elasticsearch?
A document is the basic unit of information in Elasticsearch, represented in JSON format.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 4. Explain the purpose of the 'Cluster' in Elasticsearch.
A cluster is a collection of nodes that work together and share the same cluster name, forming a single logical unit.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 5. What is the role of a 'Node' in Elasticsearch?
A node is a single instance of Elasticsearch running within a cluster, storing data and participating in cluster operations.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 6. Explain the concept of 'Tokenization' in Elasticsearch.
Tokenization is the process of breaking a text into individual terms or tokens, which become searchable in Elasticsearch.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 7. What is the purpose of the 'Refresh' operation in Elasticsearch?
Refresh makes changes to the index immediately visible for search, but it comes with a performance cost.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 8. Explain the role of 'Filter' in Elasticsearch queries.
Filters are used to narrow down the search results based on specific criteria without affecting scoring.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 9. Explain the term 'Bulk API' in Elasticsearch.
The Bulk API allows for the efficient indexing and deletion of multiple documents in a single request.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 10. What is the purpose of the 'CAT API' in Elasticsearch?
The CAT API provides concise information about the cluster, indices, and nodes in a human-readable format.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 11. What is Elasticsearch?
Elasticsearch is a distributed search and analytics engine built on top of Apache Lucene. It provides a scalable and real-time search solution.
Example:
GET /index/_search?q=query
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 12. What is a 'Node' in Elasticsearch?
A node is a single instance of Elasticsearch running on a machine. It is part of a cluster and stores data and participates in the indexing and search capabilities of Elasticsearch.
Example:
GET /_cat/nodes?v
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 13. What is the role of the 'Cat' API in Elasticsearch?
The 'Cat' API in Elasticsearch provides concise and human-readable information about the cluster, nodes, indices, and other components. It is useful for monitoring and debugging.
Example:
GET /_cat/indices?v
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 14. What is the purpose of the 'Cluster Health' API in Elasticsearch?
The 'Cluster Health' API provides information about the overall health of the Elasticsearch cluster, including the status of nodes, indices, and other vital statistics.
Example:
GET /_cluster/health
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 15. How can you limit the number of results in an Elasticsearch query?
You can use the 'size' parameter in your query to limit the number of results returned. For example, 'size': 10 will return only 10 documents.
Example:
GET /my_index/_search
{
"query": {
"match_all": {}
},
"size": 10
}
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Intermediate / 1 to 5 years experienced level questions & answers
Ques 16. Differentiate between a Shard and a Replica in Elasticsearch.
A shard is a basic unit that stores data, while a replica is a copy of a shard for fault tolerance and scalability.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 17. Explain the purpose of an Analyzer in Elasticsearch.
An analyzer is used to preprocess data during indexing and searching, including tokenization and stemming.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 18. What is Inverted Index in Elasticsearch?
An inverted index is a data structure used to efficiently map terms to the documents containing them.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 19. Describe the significance of the 'Mapping' in Elasticsearch.
Mapping defines how documents and their fields are stored and indexed, specifying data types and configurations.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 20. How does Elasticsearch handle distributed search and indexing?
Elasticsearch distributes data across nodes, allowing for parallel processing and improved performance.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 21. What is the purpose of the 'Query DSL' in Elasticsearch?
The Query DSL (Domain Specific Language) allows users to define queries and filters using a JSON-like syntax.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 22. How does Elasticsearch handle schema-less data?
Elasticsearch allows dynamic mapping, automatically inferring field data types based on the inserted documents.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 23. What is the purpose of the 'Aggregation' framework in Elasticsearch?
Aggregations provide the capability to perform complex analysis and computation on the data.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 24. How does Elasticsearch handle relevance scoring in search results?
Elasticsearch uses a scoring algorithm based on the relevance of documents to the search query.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 25. Explain the term 'Mapping Conflict' in Elasticsearch.
Mapping conflict occurs when conflicting field types are encountered during dynamic mapping.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 26. What is the purpose of the 'Snapshot' and 'Restore' feature in Elasticsearch?
Snapshot and Restore allow for the backup and recovery of an entire cluster or specific indices.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 27. Describe the 'Search Shards' concept in Elasticsearch.
Search Shards are individual units of a search request distributed across nodes for parallel processing.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 28. Explain the use of the 'Alias' feature in Elasticsearch.
Aliases are used to provide a permanent and abstract name to an index, simplifying index management and searches.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 29. What is the purpose of the 'Fielddata' cache in Elasticsearch?
Fielddata cache stores the data structures necessary for sorting and aggregating on fields, improving performance.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 30. Explain the role of the 'Cluster State' in Elasticsearch.
The Cluster State holds information about the entire cluster, including metadata about indices, nodes, and shards.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 31. What is the purpose of the 'Recovery' process in Elasticsearch?
Recovery is the process of restoring a shard to a consistent state after a node failure or restart.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 32. Explain the term 'Fuzzy Query' in Elasticsearch.
A Fuzzy Query is used to find documents that match a specified term with a certain degree of error or similarity.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 33. What is the purpose of the 'Token Filter' in Elasticsearch?
Token Filters modify the tokens generated during the tokenization process, influencing the search and indexing process.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 34. Explain the term 'Caching' in Elasticsearch.
Caching involves storing frequently used data to reduce the need for repeated computations, improving performance.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 35. What is the purpose of the 'Routing' in Elasticsearch?
Routing determines which shard a document should be stored in based on a predefined value, optimizing search performance.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 36. Explain the concept of an index in Elasticsearch.
An index in Elasticsearch is a collection of documents that share similar characteristics. It is similar to a database in relational databases.
Example:
PUT /my_index
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 37. What is a shard in Elasticsearch?
A shard is a basic unit of storage and search in Elasticsearch. Indexes are divided into shards to distribute data across multiple nodes for scalability.
Example:
PUT /my_index/_settings
{
"number_of_shards": 5
}
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 38. Explain the purpose of the term 'mapping' in Elasticsearch.
Mapping in Elasticsearch is the process of defining how a document and its fields are stored and indexed. It helps in defining the data type, analysis, and other properties.
Example:
PUT /my_index
{
"mappings": {
"properties": {
"title": { "type": "text" }
}
}
}
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 39. Explain the purpose of the 'Analyzer' in Elasticsearch.
An analyzer in Elasticsearch is responsible for processing the text during indexing and searching. It includes a tokenizer and one or more token filters.
Example:
PUT /my_index
{
"settings": {
"analysis": {
"analyzer": {
"custom_analyzer": {
"type": "custom",
"tokenizer": "standard",
"filter": ["lowercase", "my_custom_filter"]
}
}
}
}
}
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 40. What is the purpose of the 'Query DSL' in Elasticsearch?
The Query DSL (Domain Specific Language) in Elasticsearch is used to define queries in a JSON format. It allows for complex and flexible querying of data.
Example:
{
"query": {
"match": {
"field": "value"
}
}
}
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 41. Explain the 'Bulk' API in Elasticsearch.
The Bulk API in Elasticsearch allows you to index, delete, or update multiple documents in a single request for better performance. It reduces the overhead of handling individual requests.
Example:
POST /my_index/_bulk
{ "index": { "_id": "1" } }
{ "field": "value1" }
{ "delete": { "_id": "2" } }
{ "create": { "_id": "3" } }
{ "field": "value3" }
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 42. How does the 'Geo-Point' type work in Elasticsearch?
The 'Geo-Point' type in Elasticsearch is used to index and search for geographical coordinates, such as latitude and longitude. It enables spatial queries for location-based data.
Example:
PUT /my_index
{
"mappings": {
"properties": {
"location": {
"type": "geo_point"
}
}
}
}
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 43. Explain the concept of 'Refresh' in Elasticsearch.
The 'Refresh' operation in Elasticsearch makes recent changes to the index immediately visible for search. It is an important aspect for near real-time search.
Example:
POST /my_index/_refresh
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 44. Explain the concept of 'Routing' in Elasticsearch.
Routing in Elasticsearch is the process of determining which shard a document should be stored in. It is based on the document's routing value and helps distribute data evenly.
Example:
PUT /my_index/_doc/1?routing=user123
{
"field": "value"
}
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 45. Explain the use of the 'Nested' datatype in Elasticsearch.
The 'Nested' datatype in Elasticsearch is used when dealing with arrays of objects. It allows you to query and index objects as separate entities, maintaining the relationships.
Example:
PUT /my_index
{
"mappings": {
"properties": {
"comments": {
"type": "nested"
}
}
}
}
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 46. How does the 'Fuzzy' query work in Elasticsearch?
The 'Fuzzy' query in Elasticsearch is used to find approximate matches for a given query term. It is useful for handling typos or variations in spelling.
Example:
GET /my_index/_search
{
"query": {
"fuzzy": {
"field": "value",
"fuzziness": 2
}
}
}
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 47. What is the 'Wildcards' query in Elasticsearch used for?
The 'Wildcards' query allows you to perform wildcard-based searches on string fields. It supports '*' for any number of characters and '?' for a single character.
Example:
GET /my_index/_search
{
"query": {
"wildcard": {
"field": "va*lue"
}
}
}
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 48. Explain the concept of 'Field Data' in Elasticsearch.
Field Data in Elasticsearch is used to cache field values in memory for better performance. It is essential for aggregations and sorting operations.
Example:
GET /my_index/_search
{
"aggs": {
"sum_prices": {
"sum": {
"field": "price",
"format": "doc_values"
}
}
}
}
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Experienced / Expert level questions & answers
Ques 49. How does Elasticsearch handle security?
Elasticsearch provides security features like role-based access control, SSL/TLS encryption, and authentication mechanisms.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 50. How can you improve the performance of Elasticsearch queries?
Performance can be improved by optimizing mappings, using proper analyzers, and scaling the cluster horizontally.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 51. How does Elasticsearch handle conflicts during distributed writes?
Elasticsearch uses versioning to handle conflicts during distributed writes, ensuring data consistency.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 52. How can you handle high availability in Elasticsearch?
High availability can be achieved by configuring multiple nodes, using replicas, and implementing proper failover mechanisms.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 53. Describe the 'Scripting' feature in Elasticsearch.
Scripting allows users to write custom scripts for advanced calculations, filtering, and scoring in Elasticsearch queries.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 54. How does Elasticsearch handle conflicts during index updates?
Elasticsearch uses a versioning mechanism to handle conflicts during index updates and ensure consistency.
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 55. What is the role of a filter in Elasticsearch?
Filters in Elasticsearch are used to narrow down the search results based on specific criteria. They are applied to queries to exclude or include documents in the search.
Example:
GET /my_index/_search
{
"query": {
"bool": {
"filter": {
"range": {
"price": { "gte": 20 }
}
}
}
}
}
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 56. How does Elasticsearch achieve high availability?
Elasticsearch achieves high availability through the concept of replication. Each shard has one or more replicas, and if a node fails, its shards can be served by replicas on other nodes.
Example:
PUT /my_index/_settings
{
"number_of_replicas": 2
}
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 57. What is the purpose of the 'Aggregations' framework in Elasticsearch?
Aggregations in Elasticsearch provide the ability to perform data analysis on the results of a query. They enable you to extract and process aggregated information from the data.
Example:
GET /my_index/_search
{
"aggs": {
"average_price": {
"avg": {
"field": "price"
}
}
}
}
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 58. How can you secure communication in Elasticsearch?
Communication in Elasticsearch can be secured using HTTPS. You can enable SSL/TLS to encrypt the data transmitted between nodes and clients for secure communication.
Example:
PUT /_cluster/settings
{
"persistent": {
"xpack.security.http.ssl.enabled": true
}
}
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 59. Explain the purpose of the 'Reindex' API in Elasticsearch.
The 'Reindex' API is used to copy documents from one index to another. It is useful for reorganizing data, changing mappings, or upgrading Elasticsearch versions.
Example:
POST _reindex
{
"source": {
"index": "old_index"
},
"dest": {
"index": "new_index"
}
}
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 60. Explain the purpose of the 'Percolator' in Elasticsearch.
The 'Percolator' in Elasticsearch is used for reverse searching. Instead of searching for documents, it allows you to register queries and match them against incoming documents.
Example:
PUT /my_index/_doc/my_percolator_query
{
"query": {
"match": {
"field": "value"
}
}
}
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Ques 61. What is the purpose of the 'Script' query in Elasticsearch?
The 'Script' query allows you to execute custom scripts during the search process. It is useful for complex calculations or custom scoring logic.
Example:
GET /my_index/_search
{
"query": {
"script": {
"script": {
"source": "doc['field'].value > 10"
}
}
}
}
Save For Revision
Save For Revision
Bookmark this item, mark it difficult, or place it in a revision set.
Log in to save bookmarks, difficult questions, and revision sets.
Most helpful rated by users: