0% found this document useful (0 votes)
131 views19 pages

Musa Talukdar: Software Engineer 28 June, 2012

This document provides an overview of Apache Solr, an open-source search engine built on Apache Lucene. It discusses that Solr builds on Lucene by providing a server that exposes Lucene's indexing and search capabilities via REST-like HTTP APIs. It also summarizes Solr's key features such as indexing, querying, results, security, scaling, and alternatives to Solr.

Uploaded by

Musa Talukdar
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
131 views19 pages

Musa Talukdar: Software Engineer 28 June, 2012

This document provides an overview of Apache Solr, an open-source search engine built on Apache Lucene. It discusses that Solr builds on Lucene by providing a server that exposes Lucene's indexing and search capabilities via REST-like HTTP APIs. It also summarizes Solr's key features such as indexing, querying, results, security, scaling, and alternatives to Solr.

Uploaded by

Musa Talukdar
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 19

Musa Talukdar

Software Engineer 28th june,2012

About apache Solr


Created by Yonik Seeler for CNET Contribute to apache Jan 2006 Version 3.5.0 released 27th November 2012 Leading open source search engine

Apache solar prime user


cnet Monster Digg Netflix Yelp Zappos The white house (!@!)

Lucene engine
Lucenr is a high performance search engine library Lucene is not a server Embeddable library Dose not provide runtime service

Solr = server + lucene


Solr is a java web application Exposed over HTTP, REST-like interface

Two Basic Configaration


solrconfig.xml schema.xml

Lucene index parameters


Request handler mapping Cache setting plugins

Field type Dynamic fields

Three step process


Declare schema Indexing data Query

schema
Data type Field Dynamic field Miscllaneous setting

Analyzers, Tokenizers, and Token Filters


When a document is indexed, its individual fields are subject to the analyzing and tokenizing filters that can transform and normalize the data in the fields.

Indexing data
Indexing data is done by HTMP post Using key value pair Commit/ roleback

Index binary documents


Xml Jason MS office Open office Pdf image Mp3 zip

Query

Results
XML(Default) Json PHP Python Ruby

Advance feature
Spell checking Highlight More like this

Security
Doesn't provide any security by itself Relies on server security No standard document level security

Scaling

Disadvantage of apache solr


NO formal support contract No assured availability of training or other professional services to fulfill specific software need. No formalized release testing program, release schedule or assurance of upgrade compatability

Alternative of Apachi solr


Elastic Search is another search engine based on Lucent. SearchBlox is also based on Lucent Xapian is another search engine written in C++ Sphinx

Resources
http://www.lucidimagination.com http://lucene.apache.org/solr http://wiki.apache.org/solr http://en.wikipedia.org/wiki/Apache_Solr Solr presentation by Mike Brevoort

You might also like