The Dublin Core metadata element set
is a standard for cross-domain information resource description.
|
||
Element | Description | Value |
---|---|---|
Identifier | resource ID | http://old.zope.org/Members/TheJester/ZQuadSex |
Title | resource name | Zope Quick and Dirty Search Engine and Indexer |
Description | resource summary | ZQUADSEX: Zope QUick And Dirty Search Engine and indeXer What's it for? Well it's for making a search engine, based on content you provide, either by o Submitting URLs to be indexed (which are not stored), o Uploading files to be indexed (which are not stored), o By using the <dtml-indexer></dtml-indexer> block tag to submit content for indexing. Using it: Install a 'Text Indexer' product in your tree somewhere. Then you can submit URLs, or Files to be indexed into it (via dtml if you want). You can also use <dtml-indexer></dtml-indexer> to wrap content to be indexed. URL/Tag based indexing uses the absolute_url() to determine an ID for it, so if you use variables to alter the display of your page, then you'll need to do some work to get it to work. *** Warning: the stats page has some code commented out, uncomment it for an interesting view of your index, but, it'll take a *long* time to render. ------------------------------------------------------------------------ You can specify the default search mode: o And Mode: all words in a query must match o Or Mode: any words in a query can match You can override the search method dynamically if you want to. You can specify an amount of text to keep as a summary. You can specify a threshold (as a percentage of the total) for words to ignore in searching. Ignored words do not abort an 'and' search. You can specify a minimum period of time before reindexing occurs (useful for <dtml-indexer>. o Indexing does not occur if content has not changed since last indexing. o URL indexing removes html tags before indexing so 'font' and 'td' don't end up being the most frequent words in your index. Indexing is quite slow, but searching is quite quick, queries on 40Mb of data (that is data submitted to be indexed), takes on average less than .5 second. Can I do 'advanced' searches? Short Answer: No. Long Answer: Yes. Can it be made faster? Yes, it's not particularly optimised right now, there's quite a few things that can be done to make things run quicker, smoother, nicer, pick your superlative. Will you eventually do that? Probably not, unless someone wants to hire me to work on it, its sufficient for my needs right at the moment (although I do have a version thats tweaked for my dataset). |
Creator | resource creator | TheJester |
Date | default date | 2001-11-23 03:40:04 |
Format | resource format | text/html |
Type | resource type | Software Package |
Subject | resource keywords | Content Object, Helpers, Navigational, Server |
Contributors | resource collaborators | |
Language | resource language | |
Publisher | resource publisher | No publisher |
Rights | resource copyright | |
|
||
Element | Description | Value |
CreationDate | date resource created | 2001-11-23 03:40:04 |
ModificationDate | date resource last modified | 2001-11-23 03:40:04 |
EffectiveDate | date resource becomes effective | None |
ExpirationDate | date resource expires | None |
Backlinks:
via
Google
/
Technorati
RDF:
view RDF data