All required information. Introduction Release and Legal Info Installation Documentation Change Log [ Intro ] Sphider-plus is a search engine based on the original Sphider scripts created by Ando Saabas. In front of original Sphider additional mods, functions, template designs and debugging have been performed. For details about all changes, . . .
. . . Sphider-plus offers a wide range of customizing the index and search procedures. By means of an Admin backend, all settings are presented. As stated above, this search engine uses some PHP libraries and extensions. When opening the Setting interface, the existence off these libraries are tested by software, and in case that a library is not. . .
. . . of links might be interrupted, because the granted time slice might end before index procedure is finished. Especially if you intend to index not only text, but also media content like images, as well as audio and video streams. Sphider-plus tries 3 times to reconnect to the database. But if the server canceled the script, it will become. . .
. . . tries 3 times to reconnect to the database. But if the server canceled the script, it will become necessary to manually invoke again the index procedure to continue. Sphider-plus will remember the last indexed link and continue the suspended process. Some special functions like e.g. 'cyclical indexing' in any case will fail on a 'Shared Hosting'. . .
All required information. Introduction Release and Legal Info Installation Documentation Change Log [ Release ] Name: Sphider-plus Version: 4.2021b Released: May 12, 2021 Based on original Sphider version 1.3.5, released 2009-12-13 [ Legal Info ] This program is licensed under the GNU GPL v.3 by Rolf Kellner [Tec], tec(a t)sphider-plus.eu . . .
All required information. Introduction Release and Legal Info Installation Documentation Change Log [ Documentation Summary ] Preamble: The info presented here is valid only for the latest release of Sphider-plus. At present version 4.2021b published May 12, 2021 is the actual release. 1. Settings, customizing and statistics 2. Indexing 2.1 . . .
. . . May 12, 2021 is the actual release. 1. Settings, customizing and statistics 2. Indexing 2.1 Various options 2.2 Allow other hosts in same domain 2.3 Word stemming 2.4 Periodical Re-indexing 2.5 Preferred indexing 2.6 Multithreaded indexing 2.7 Create thumbnails during index procedure 2.8 Prevent indexing of known malware and pishing pages 2.9. . .
. . . Use private sitemap instead of global sitemap. 2.11 Create Sitemap file 3. Using the indexer from command line 3.1 All options 3.2 Multithreaded indexing 4. Keeping pages, words and files from being indexed 4.1 robots.txt 4.2 Must include / must not include string list 4.3 Ignoring links 4.4 Ignoring parts of a page by <! sphider_noindex >. . .
. . . Enhancing functionality of multiple database support 17. Search in categories 17.1 Hierachical structure 17.2 Parallel structure 18. User suggested sites 19. Vulnerability protection 19.1 Intrusion Detection System IDS 19.2 Prevent queries from Meta search engines and crawler known to be evil 19.3 Basic input validation against vulnerability. . .
. . . for Sphider-plus. Separated into different submenus like: Sites: - Add Site - Index only the new - Re-index all - Re-index only preferred URLs - Erase Re-index (available also for individual URLs) - Import/export URL list - Approve sites - Banned domains Categories: - Add, edit, delete - Create new subcategory under Index: - Basic indexing. . .
All required information. Introduction Release and Legal Info Installation Documentation Change Log [ Change Log Summary ] [ Actual release ] Version: 4.2021b Release date: May 12, 2021 New option: Implemented a pure PHP based converter in order to index .doc documents which were created by Microsoft Word 97. New option in ‘Statistics’ menu: Sho . . .
. . . not activated link options. Improved index procedure for option ' If available follow sitemap.xml '. Now skipping all dis-activated document links (in admin backend) like: PDF, XLSX, DOC, DOCX, ODT, etc. Reactivated captcha protection in option ' User may suggest a URL to be indexed '. Implemented with new algorithm to generate the captcha. Bug. . .
. . . suggest a URL to be indexed '. Implemented with new algorithm to generate the captcha. Bug fixed in option ' Block all queries sent by known spammer (IPs) '. Bug fixed in option ' Phrase’ search '. Some small bugs fixed. Involved folders and files that have been modified / added for this release: /addurl.php /admin/admin.php. . .
All required information. Introduction Release and Legal Info Installation Documentation Change Log [ Change Log Summary ] Version v.2.0 Release date: May 27, 2009 In front of Sphider-plus version 1.9 the following items have been added / modified: Multiple database support for up to 5 independent databases (expandable). Individual activation o . . .
. . . access permission for database configuration, independent from Admin login. Integrated availability check for all databases and their release relevant table structure. Individual for each database: - Backup and restore - Copy / Move from each database to each other database 32 MByte query cache for MySQL database. - To be activated in Admin. . .
. . . website, the crawler will be redirected to the canonical link and Sphider-plus will understand that the duplicates all refer to the canonical URL. For more details, please notice chapter Canonical <;link>; tag Index websites that are created with ASP.NET Definition for path to PDF converter integrated into Admin Settings interface.. . .
. . . are created with ASP.NET Definition for path to PDF converter integrated into Admin Settings interface. Additionally the default setting - as required for the Operating System environment - is suggested. If path to PDF converter is invalid and converter is not accessible, an error message (in Admin Settings dialog) is created. Additional Admin. . .
. . . not accessible, an error message (in Admin Settings dialog) is created. Additional Admin setting to enable optionally indexing of external hosted media content. Improved index procedure of media files, by avoiding indexing of duplicate media content. Improved image indexing by reducing the required download time. Improved index / re-index. . .
All required information. Introduction Release and Legal Info Install ation Documentation Change Log [ Change Log Summary ] Version v.2.1 Release date: September 03, 2009 In front of Sphider-plus version 2.0 the following items have been added / modified: New item in Admin settings: Perform a segmentation of Chinese and Korean text during index / . . .
. . . during index / re-index procedure. Will divide phrases like 帽子和服装 into the base words 帽子 and 和 and 服装 , so that all will become searchable. Valid for Chinese sites with charset: GB2312, GBK and GB18030 Valid for Korean sites with charset: EUC-KR and ISO10646-1933 New item in Admin setting: Index password protected sites. If enabled,. . .
. . . any the words in whitelist - Use whitelist in order to enable index / re-index only those pages that include all the words in whitelist Improved 'Follow sitemap.xml' procedure: If <;sitemapindex . . >; is detected in a sitemap.xml file, and if multiple Sitemap files are available, Sphider-plus will process the secondary Sitemaps and. . .
. . . file, and if multiple Sitemap files are available, Sphider-plus will process the secondary Sitemaps and extract all links for index / re-index. Also gzip-compressed files (Index Sitemap files as well as the Sitemap files) will be processed. Improved index / re-index procedure: If charset of a site to be indexed is undetectable, because it is. . .
All required information. Introduction Release and Legal Info Installation Documentation Change Log [ Change Log Summary ] Version v.2.2 Release date: December 22, 2009 Build up with Sphider: v.1.3.5 In front of Sphider-plus version 2.1 the following items have been added / modified: Improved multiple database support: Results may now be c . . .
. . . - 5 databases could be configured to fetch results for the common result listing. Valid for text and media search, all search modes, taking into account category selection. More details in documentation chapter Activate / Disable databases Improved RSS and Atom feed index procedure. Including now also a validation for the well-formed XML. Support. . .
. . . Follow CDATA directives for feed content. Additional item in Admin settings: Index 'Dublin Core' and other individually marked tags in RDF feeds. Additional item in Admin settings: Follow the 'preferred (true/false)' directive in RSD feeds. Detection of encoding (charset) added for XML and XHTML files. New item in Admin settings: During index. . .
. . . of encoding (charset) added for XML and XHTML files. New item in Admin settings: During index procedure, convert all kind of single quotes like ` ´ ’ ‘ into standard quotes ' New item in Admin settings: Reduce queries which contain quotes to the basic word. This will deliver the same results for queries like: d'information = information or. . .
. . . or dei'largi = largi Results will be highlighted for the base word. Exclusive noun, pronoun, etc. Works for all kinds of single quotes. New Admin setting: For queries containing numbers, search with wildcards. Useful to search for complex article numbers, if the user only knows a part of the complete item description New Admin setting:. . .
All required information. Introduction Release and Legal Info Installation Documentation Change Log [ Change Log Summary ] Version v.2.4 Release date: July 03, 2010 Build up with Sphider: v.1.3.5 New feature: In order to reduce the time for indexing, multithreaded indexing was implemented. As part of the Admin settings, 1-10 threads are to be a . . .
. . . line options: - Erase content of database <;-erase>; - Set ‘Last indexed’ date and time to 0000 <;-preall>; Improved support for Japanese coded sites (charset: SHIFT_JIS, EUC-JP and UTF-8). Template design 'Pure' reduced to 'like Google'. Improved search algorithm: significantly reduced search time. Improved support for Greek. . .
. . . Transliterate queries with Latin characters into their Greek equivalents. Will for example transform query input alla to find ἀλλὰ and baptismatos to find βαπτίσματος. 2. Accept Greek queries containing vowels without accents. Query input of letter α will be valid also for ἀ, ἁ, ἂ, ἃ, ἄ, ἅ, ἆ, ἇ, ὰ, ά, ά, ᾀ, ᾁ, ᾂ, ᾃ, ᾄ, ᾅ, ᾆ, ᾇ, ᾰ, ᾱ, ᾲ, ᾳ, ᾴ,. . .
. . . also for ἀ, ἁ, ἂ, ἃ, ἄ, ἅ, ἆ, ἇ, ὰ, ά, ά, ᾀ, ᾁ, ᾂ, ᾃ, ᾄ, ᾅ, ᾆ, ᾇ, ᾰ, ᾱ, ᾲ, ᾳ, ᾴ, ᾶ and ᾷ The same behavior for all other Greek vowels, as well as for the upper case vowels. Both options will create a tolerant result listing. New options in 'Add site' and 'Edit site' menus: - Enter URL of individual Sitemap If Sitemap is not in root folder,. . .
. . . number of results shown per domain is selectable. Offers result presentation similar to 'Like Google', but additionally offers a selectable count of links. Search option 'More results from this domain' not only enabled for result sorting 'Like Google', but also for 'By URL names'. Bug fixed that prevented correct interpretation of http 301. . .