12, 2024 is the actual release. 1. Settings, customizing and statistics 2. Indexing 2.1 Various options 2.2 Allow other hosts in same domain 2.3 Word stemming 2.4 Periodical Re-indexing 2.5 Preferred indexing 2.6 Multithreaded indexing 2.7 Create thumbnails during index procedure 2.8 Prevent indexing of known malware and pishing pages 2.9 Follow . . .
. . .
Follow and create sitemap files. See below for details . Word stemming. See below for details . 2.2 Allow other hosts in same domain This Admin selectable option allows to index other hosts with the same domain name and it also ignores TLD, SLD and www. If e.g. calling from http://www.sphider-plus.eu links like: - http://sphider-plus.eu . . .
. . .
The first one is following all links found during index procedure. The second one is only following the links to other hosts, if the found links are redirected. 2.3 Word stemming Sphider-plus is offering language specific stemming algorithms for 15 languages: Bulgarian, Chinese, Czech, Dutch, English, Finnish, French, German, Greek, Hungarian, . . .
. . .
remain activated, even if word stemming later on is reset to 'none' and need to be deselected manually. On the other hand, if activated for indexing, the stemming selection must remain activated, because also the query input must be stemmed. As during the index procedure only the etymons are stored in database, this will create results . . .
. . .
the Re-index procedures will silently work in the background without creating monitor output, but like in all other indexing modes, writing the index results intolog files. Additionally a log file showing the status of the periodical indexer is created, presenting all dates and times when the Re-index procedures were started, as well as the . . .
. . .
the 'Back to admin' button is presented. Never the less, a former started thread still might be busy to index another site. To be seen in Admin 'Sites' view by the 'Unfinished' message at the corresponding site. Refreshing the 'Sites' window offers the successful end for all threads; by replacing the 'Unfinished' message with the date of last . . .
. . .
<-preall> This will reset all 'Last indexed' tables to '0000', but will not erase the content of all the other tables. So the check whether the content of a page has changed (MD5sum) is still available for a fast re-index procedure. Once prepared, multithreaded re-indexing could be invoked by starting several threads and adding . . .
. . .
may alternately contain a regexp pattern. The regexp needs to be introduced by */ and must be ended with another slash. Example: */ menu[0-5] / 4.6 Indexing only parts of a page by <div id='abc'> If enabled in Admin settings, the values as defined in the list-file /include/common/divs_use.txt will be used to index only the content . . .
. . .
may alternately contain a regexp pattern. The regexp needs to be introduced by */ and must be ended with another slash. Example: */ table[0-5] / 4.7 Ignore HTML elements defined by <tagname> . . </tagname> This option is foreseen to cooperate with the new HTML5 elements like section, nav, aside, hgroup, article, header, . . .