welcome: please sign in

次の269語(ハイライト表示)は、1209語の辞書 1209語のLocalSpellingWordsを含む)中に見つかりませんでした。
about   addition   additional   advanced   allows   already   antiword   attachment   attachments   auto   available   back   basically   big   Binary   bindings   blacklist   bottom   build   buildnewindex   but   by   calls   can   case   catdoc   catppt   check   checking   command   Command   commonly   complete   completely   Config   config   configdir   configuration   configure   contain   containing   Contents   Creating   default   dependencies   Dependency   descriptions   dialogue   dir   directly   disable   disabling   disk   Document   Don   don   dramatically   Due   eat   enable   enabled   enables   Enabling   enabling   encodings   engine   engines   etc   Excel   except   execute   existing   extracted   fall   False   feature   features   ffffcc   files   filter   filters   find   For   for   forces   forget   forms   from   furthermore   gets   heading   history   how   if   If   images   improve   included   indexed   indexing   indices   information   Initially   initially   installed   installing   interfere   introduced   iso   its   itself   jpg   language   large   least   legacy   lets   like   line   linking   list   long   look   lot   means   might   mode   more   need   needs   new   next   non   None   note   Notes   od   older   On   on   one   only   options   or   pdf   pdftotext   perform   performance   performed   please   Please   plugins   poppler   possible   Powerpoint   ppt   prefixed   prefixes   process   projects   pywin32   rather   raw   Re   re   rebuild   rebuilds   recent   referred   Requirements   restart   results   revision   revisions   rowbgcolor   run   running   same   save   saved   searches   Searching   searching   section   see   See   Setting   several   should   size   slow   some   space   specify   Star   start   stemmed   stemming   stop   strings   stuff   supplied   support   switch   sx   System   Table   tablewidth   take   terms   test   tested   tester   Testing   testing   that   their   Then   this   This   those   time   timings   to   To   tool   tries   True   type   unavailable   underlay   unlock   up   update   usable   Usage   use   used   usenewindex   uses   Using   using   utf   utils   versions   want   way   well   were   what   where   whether   which   while   wikiconfig   will   Windows   wish   with   won   Word   words   work   Xapian   xapian   xls   xls2csv   xpdf   yield  

メッセージを消す
location: HelpOnXapian

Xapian

Using Xapian you can dramatically improve the performance of searching in moin and furthermore unlock some more features (see the search prefixes above) not possible with the legacy search engine.

Requirements

You must have Xapian itself and its Python bindings (xapian-core and xapian-bindings) from http://www.xapian.org/ at least in version 1.0.6 installed. In addition, Windows users need to install pywin32 from http://sourceforge.net/projects/pywin32/.

To process attachment files, moin uses filter plugins - here is the list of filter plugins included:

File type

Dependency

Notes

Text files (.txt)

-

tries utf-8 and iso-8859-15 encodings (or forces to ASCII if those do not work)

JPEG images (.jpg)

-

EXIF data is extracted

Open Office files (.sx?)

-

e.g. from older OpenOffice.org/StarOffice versions

Open Document files (.od?)

-

e.g. from recent OpenOffice.org/StarOffice versions

Binary files

-

moin uses a strings like filter to process those, as well as a blacklist with stuff you don't want to search

MS Word files (.doc)

antiword

filter calls antiword

MS Excel files (.xls)

catdoc

filter calls xls2csv

MS Powerpoint files (.ppt)

catdoc

filter calls catppt

PDF files (.pdf)

xpdf-utils or poppler-utils

filter calls pdftotext

After installing additional filters (or dependencies) you should (re)build your index. Xapian will find the new filters / support packages automagically. The next time your search results may contain results linking directly to your attachments.

Configuration

In your wikiconfig, you have several options on how to configure Xapian:

変数名 デフォルト 説明
xapian_index_dir None Directory where the Xapian search index is stored (None = auto-configure wiki local storage)
xapian_index_history False True to enable indexing of non-current page revisions.
xapian_search False True to enable the fast, indexed search (based on the Xapian search library)
xapian_stemming False True to enable Xapian word stemmer usage for indexing / searching.

(Re-)Building an index

You can use the supplied command line tool moin to initially build, completely rebuild or update an existing index.

To (re-)build your index please execute:

moin --config-dir=/where/your/configdir/is --wiki-url=wiki-url/ index build --mode=rebuild

For more information about the moin index command, see HelpOnMoinCommand.

/!\ Please note that you must rebuild your index if you change any of the xapian_index_history, xapian_index_dir or xapian_stemming configuration options!

If you have a large site, you may not wish for searching to be unavailable while your index rebuilds. In this case, you can start with moin index build --mode=buildnewindex. It is slow, but won't interfere with the running wiki. Then, stop the wiki, run moin index build --mode=usenewindex to switch to the new index, and restart the wiki.

Testing

You can test if Xapian is enabled and if an index is available by checking SystemInfo. To check if searches are performed using Xapian, enable show_timings in your wikiconfig, perform a search and look for _xapianSearch on the bottom of the page.

Usage

Xapian is basically used the same way as all other search engines. Due to Xapian's advanced features some new search term prefixed were introduced which are not already available in the legacy search engine (commonly referred to as moin search). See HelpOnSearching for more information and/or use the new advanced search dialogue available on FindPage to see what's available and possible.