Lucene
0.定義Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java.
It is a technology suitable for nearly any application that requires full-text search, especially cross-platform
Apache Lucene是一個高性能,全文本特性的JAVA搜索引擎.它的技術(shù)適合于任何全文搜索應(yīng)用,特別是跨平臺.
1.simpleExample 提供最簡單的sample
2.特性
Lucene is a high performance, scalable, cross-platform search engine that contains many advanced features that
often go untapped by the majority of users. In this session, designed for those familiar with Lucene,
we will examine some of Lucene's more advanced topics and their application, including:
Term Vectors:
Manual and Pseudo relevance feedback;
Advanced document collection analysis for domain specialization
Span Queries:
Better phrase matching; Candidate Identification for Question Answering
Tying it all Together:
Building a search framework for experimentation and rapid deployment
Case Studies from CNLP:
Crosslingual/multilingual retrieval in Arabic, English and Dutch;
Sublanguage specialization for commercial trouble ticket analysis;
Passage retrieval and analysis for Question Answering application
Lucene是一個高性能,可收縮,跨平臺搜索引擎,包括從未被大多數(shù)用戶使用很多高級特性.對此,我們將檢查Lucene一些更多高級主題和應(yīng)用,包括:
Term矢量:
人工和虛擬適當(dāng)?shù)姆答?
高級特殊化域文檔收集分析;
Span (=Statistical Processing and Analysis 統(tǒng)計處理及分析)查詢
更好短語匹配;
緊密結(jié)合
創(chuàng)建一個試驗搜索引擎框架和快速開發(fā)
來自CNLP學(xué)習(xí)用例
使用阿拉伯的, 英語 和荷蘭語交流/多語言
...
3.關(guān)鍵字/關(guān)鍵用語
4.技術(shù)
5.原理
6.同類對比
7.深入研究
8.Referrence
<Lucene in Action>
posted on 2005-12-21 18:37 jame 閱讀(277) 評論(0) 編輯 收藏 所屬分類: Structs,spring,Hibernate等開源技術(shù)研究