The Apache Jakarta team has announced the final release of Lucene 1.3. Jakarta Lucene is a high-performance, full-featured text search engine written entirely in Java. It is a technology suitable for nearly any application that requires full-text search, especially cross-platform.
- Posted by: Dion Almaer
- Posted on: December 26 2003 16:25 EST
Check out the Lucene Change Log for recent changes.
Download Lucene 1.3 Final.
Go to the Jakarta Lucene Home Page
TheServerSide uses Lucene for our new search tool, and thanks the Lucene team for their hard work.
- 1A Software :-)! by Henrik Klagges on December 28 2003 16:57 EST
- Other search engines.. by Paul Danckaert on December 29 2003 10:58 EST
- Series of essays on search by Patrick Carroll on December 29 2003 11:20 EST
- some flaw by Huang Kai on December 29 2003 20:26 EST
here a BIG thanks to the Lucene Developer Community. We use Lucene as a search engine in a I18N B2B eCommerce website with good success. One of our partner companies is running an absolutely huge search website on Lucene (they are top 3 in Germany in their area of business). They used to run a commercial search engine on 20 Solaris multiprocessor servers. With Lucene, they only need 5 of the same machines _and_ get better response times as well. Pretty good!
Lucene looks like a nice engine.. I haven't used the most recent versions, but the prior versions were quite nice.
As a point of reference, another engine I have used in the past was Ultraseek, by Infoseek...er.. Inktomi.. er.. Verity! It was interesting as it was compiled python and gave you the option to override functions with your own method.. so you could change say, the ranking or dating capabilities really flexibly. (That worked out great for one client, who had a non-standard HTML META tag in the document for the publishing date..) They now have a Java API into the engine, and a Content Classification Engine (CCE) for auto-building topic trees.. On the bad side (if you consider it so) its fairly expensive. Unless you need all of their bells, whistles, and occasional flute.. Lucene should handle all of your normal searching needs..
Lucene is mentioned, so there is at least a tangential connection.
Lucence is a great engineer.But it seems somewhat badly support wide-character language,maybe it's due to the complexity of word-division.We chinese folks have to recode some parts of it,though it's not very difficult
Please check jakarta-lucene-sandbox module in CVS - it contains a much improved support for processing CJK. This support didn't make it into the release because it was too late.
Perhaps there are more free SE out there is really also cool like :-
mnoGoSearch (formerly known as UdmSearch) is a full-featured web search engine software for intranet and internet servers. mnoGoSearch for UNIX is a free software covered by the GNU General Public License and mnoGoSearch for Windows is a commercial search software version.
SWISH-E is a fast, powerful, flexible, free, and easy to use system for indexing collections of Web pages or other files.
The ht://Dig system is a complete indexing and searching system for a domain or intranet.