众所周知的原因,帮大家把这篇文章转载过来,写的很好。
It seems like Bloom filters are all the rage these days. Three years ago I had barely heard of them and now it seems like I see articles and code using them all the time. That's mostly a good thing, since bloom f[......]
众所周知的原因,帮大家把这篇文章转载过来,写的很好。
It seems like Bloom filters are all the rage these days. Three years ago I had barely heard of them and now it seems like I see articles and code using them all the time. That's mostly a good thing, since bloom f[......]
编译Sphinx 1.10之后,看看它到底是怎么用的。
总的来说,检索系统就是建索引和搜索两个过程。
由于我们不准备使用MySQL引擎的部分,数据源采用XML接口,因此和官方文档中出入较大。
1、配置Sphinx
cd /usr/local/sphinx/etc
sudo cp sphinx.conf.dist sphinx.conf
#编辑配置文件
sudo vim sphinx.conf
#xml数据源配置
source src1
{
type[......]
KC即Kyoto Cabinet就不多说了,是Tokyo Cabinet即TC的C++升级实现,而且版本更新较为频繁(是不是说明不太稳定呢?)。
如果是纯key-value的存储的化,KC是不二的选择。KT是KC的网络接口,下面记录一下编译过程。
环境:Ubuntu 10.04 LTS,Python 2.6.6
#KC
wget http://fallabs.com/kyotocabinet/pkg/kyotocabinet-1.2.57.tar.gz
tar -xzvf ky[......]
1、下载
wget http://sphinxsearch.com/files/sphinx-1.10-beta.tar.gz
tar -xzvf sphinx-1.10-beta.tar.gz
cd sphinx-1.10-beta/
2、相关库
出了libexpat用于xml之外,还有一个libstemer可选。其实还有libiconv和libzip,只不过我好像之前装过了。
sudo apt-get install libexpat-dev
3、编译
我不需要mysql支持[......]
http://blogfreakz.com/django/django-tutorials/[......]