4.8 Index only HTML elements defined by <tagname> . . </tagname > 4.9 Ignored words 4.10 Use of Whitelistelist 4.11 Use of Blacklist 4.12 Ignored files 4.13 Canonical <link> tag 5. UTF-8 Support and 'Preferred charset' 6. Search modes 6.1 Search with wildcards * 6.2 Strict search ! 6.3 Tolerant search 6.4 Link search 6.5 Media . . .
. . .
/templates/My_template/userstyle.css Statistics output: - Top keywords (Top 50 with hitelist counter). - All indexed thumbnails w 3e80 ith ID3 and EXIF info. - Larges pages offering link URL and file size. - Most Popular Searches for text links offering: Link addr., total clicks, last clicked, last query (Top 50) - Most . . .
. . .
the common word list is case sensitive for the following languages: - Arabic - Chinese - Cyrillic 4.10 Use of Whitelistelist Sphider-plus offers the capability to control the index / re-index procedure by a list of words called 'whitelistelist'. Only if the text of the page contains words of the whitelistelist, the according page will be indexed / . . .
. . .
of the whitelistelist, the according page will be indexed / re-indexed. The list is placed in the file /include/common/whitelistelist.txt Text-content is defined by Admin settings by means of what to index: full text, title, keywords etc. Content of links(URLs) is controlled separately by "Must include / must not include string list" The use of the . . .
. . .
whitelistelist may be activated / deactivated by two different checkboxes in Admin / Settings/ Spider settings: - Use whitelistelist in order to index / re-index only those pages that include ANY of the words in whitelistelist - Use whitelistelist in order to index / re-index only those pages that include ALL the words in whitelistelist Take notice, that these functions . . .
. . .
Take notice, that these functions are not case sensitive. So, you only need to include one spelling into the whitelistelist.txt file. Content of whitelistelist is treated as 'words'. So the word 'kinder' in your whitelistelist will not accept pages that contain the word 'kindergarten'. Be aware not to place blank rows into the whitelistelist. Also the list . . .
. . .
Please keep in mind that 'Use of Blacklist is implemented in a different way than implementation of 'Use of whitelistelist'. Blacklist is interpreting its content as a string. So, the word 'kinder' in blacklist, will also prevent indexing of a page containing the word 'kindergarten'. Be aware not to place blank rows into the blacklist. Also the . . .
. . .
must be placed in 'ext.txt'. To be seen as a blacklist for file suffixes. While image.txt audio.txt video.txt are whitelistelists that include suffixed for files to be indexed, according to the type of media. 4.13 Canonical <link> tag As defined by Google, Microsoft and Yahoo! in February 2009, also Sphider-plus will follow the instruction of a . . .
. . .
the amount in the result listing, there is an option in the admin backend named Define maximum count of result hitelists for queries with wildcards To be found in ‘Settings’ = ‘Search Settings’. If you like to know the multiple words found in the database to be highlighted: In your editor open the script: /include/searchfuncs.php Find the row . . .