Discussions

News: Solr 1.1, Lucene based search server, released

  1. Solr 1.1, Lucene based search server, released (12 messages)

    Apache Solr 1.1 is the first Solr release since joining the Apache Incubator. Solr is a high performance full-text search server based on Lucene, written in Java5, and easily extensible through plugins written in Java. Documents are added to a search collection via XML over HTTP, and the search collection is queried via HTTP to receive an XML response (or alternately JSON, Python or Ruby text formats). Major featurs include:
    • A powerful data schema
    • External configuration via xml
    • Faceted search
    • Hit highlighting
    • Flexible caching
    • Replication
    • A web admin interface
    For more information see the detailed feature list. The release is available for download here. Apache Solr is currently under incubation and is slated to become a sub-project of the Apache Lucene project.

    Threaded Messages (12)

  2. I recently used Solr on a project and it has worked wonderfully. The added features that were implemented on top of Lucene really helped us out.
  3. Xml Documents Only?[ Go to top ]

    Does Solr handle XML Documents only? If so it is a very useful (but incomplete) next step on top of Apache Lucene. Paul http://red-piranha.sourceforge.net
  4. Re: Xml Documents Only?[ Go to top ]

    Does Solr handle XML Documents only?
    No, XML is only used as the transport for indexing Lucene documents (Solr doesn't have "crawlers"). This assumes you already know how to split up your data into fields and values. Example: Lucene In Action Erik Hatcher Otis Gospodnetic 1932394281 [...]
  5. Solr and clustering[ Go to top ]

    Can someone highlight the strengths of Solr in terms of clustering and replication? The Solr homepage mentions that they can do this and that - but don't describe HOW they achieve it. Project based usages could help to decide whether this a solution worth considering.
  6. Re: Solr and clustering[ Go to top ]

    Can someone highlight the strengths of Solr in terms of clustering and replication?

    The Solr homepage mentions that they can do this and that - but don't describe HOW they achieve it.
    Project based usages could help to decide whether this a solution worth considering.
    Did you read the Wiki? http://wiki.apache.org/solr/ This items focus on what you want: # CollectionDistribution * SolrCollectionDistributionScripts * SolrCollectionDistributionStatusStats * SolrCollectionDistributionOperationsOutline # CollectionRebuilding I hope it helps. Luis Neves
  7. Re: Solr and clustering[ Go to top ]

    Master/Worker pattern, similar to Terracotta DSO/Clustering. basically a centralize server that handles all the updates that will happen.. and you have "searcher" nodes which maintains a local copy of the index,the master server updates the worker nodes indexes in interval...
  8. Could you provide comparison with Nutch please?
  9. Could you provide comparison with Nutch please?
    Nutch is more like an open-source google... it's for crawling, converting, indexing, and searching websites. Solr is more of a general-purpose search server, and it assumes you already have structured data (like catalog data, music collections, etc).
  10. Structured Data?[ Go to top ]

    If the data is structured, why not use a real database? Steve
  11. Re: Structured Data?[ Go to top ]

    Current free databases don't do full-text search well, and it's painful to try and do things like faceted search.
  12. Hi Everyone, Can you please let me know what configurations should be done for highlighting search key words in results obtained from Lucene. I read somewhere that i have to give hl=true while querying for a document in Lucene & the attributes to be highlighted will be enclosed in tag, when we receive the response document from Lucene. But,i am not getting any attributes enclosed with tag. Please let me know the steps for highlighting the search keys in the result, that i have got from Lucene. Thanks, Anil
  13. Highlight[ Go to top ]

    You need to explicitly specify which fields you want to highlight: hl.fl=text1,text2