Sphider-plus



Displaying results 1 - 10 of 15 matches

1.   Sphider-plus - The PHP Search Engine Visit in a new window

of the word exceeding the defined length. Not inside a word at the character count limit defined in Admin setting. PDF converter for LINUX/UNIX Operating Systems included. Needs to be individualized according to readme.PDF documentation, chapter 'PDF converter for Linux server.' Thanks to rasc. Additional item in Admin section: Server Info To be . . .
. . .
To be found in submenu 'Statistics', important information are presented for: - Server - Environment - MySQL - PDF converter - php.ini file - PHP integration Enlarged Admin interface if database is empty. Improved printout for database connection problems. Now MySQL error message is included. Improved printout if text converter could not . . .
. . .
problems. Now MySQL error message is included. Improved printout if text converter could not extract words from PDF, DOC, XLS etc. files. Improved printout for Database Backup Management. Modified installation script. Thanks to Flemp. Font file renamed to captcha.tff (former: captcha.TTF). Thanks to ethix. All style sheets now are centralized . . .
. . .
/admin/real_get.php /admin/real_log.php /admin/real_ping.js /admin/spider.php /admin/spiderfuncs.php /converter/PDFtotext /converter/PDFtotext.script /include/captcha.tff /include/commonfuncs.php /include/searchfuncs.php /include/js_suggest/suggest.php /settings/conf.php /settings/database.php /templates/all_folders/navdown.jpg . . .
. . .
'If available follow sitemap.xml ' in order to prevent 'Page is duplicate ' messages. Improved printout if PDF files cause indexing problems. If 'Follow sitemap.xml' is activated and a valid sitemap was found, the log output Links found: 0 - New links: 0 is no longer shown. Because all links are delivered from the sitemap file and new . . .
. . .
Top [ Outdated versions ] Version 1.1 Build up with Sphider v.1.3.4.b Included converters for indexing PDF, DOC, RTF, XLS and PPT files. To be activated individually in Admin settings Warning message during index process when deactivated file was found Captcha protection for Submission Form 'Suggest a new Site' . Use of Captcha to be . . .

2.   Sphider-plus - The PHP Search Engine Visit in a new window

6.9 Block queries 7. Chronological order for result listing 7.1 Text result listing 7.2 Media result listing 8. PDF converter 9. Clean resources during index / re-index. 10. Enable real-time output of logging data 11. Error messages and Debug mode 12. Delete secondary characters 13. Media search for images, audio streams and videos 13.1 Media . . .
. . .
and time of flood attempt. - Auto Re-index log file - Server info offering: Server software, environment, MySQL, PDF-converter, image functions, php.ini file PHP integration, PHP security info. Each item holding lists of details. All text links, media links and thumbnails are active linked. As stated in chapter Introduction , this search engine . . .
. . .
whether you like to use this web service for all pages to be indexed. More details are described in the readme.PDF documentation. 2.8 Prevent indexin 3e80 g of known malware and pishing pages This feature is a service, provided by Google that enables applications to check Internet URLs against Google's constantly updated lists of suspected . . .
. . .
additionally requires an individual key. To be signed up at Google. More details are described in the readme.PDF documentation. 2.9 Follow Sitemap file To be activated in Admin settings, Sphider-plus will use the links found in sitemap.xml or sitemap.xml.gz files. This significantly increases the speed for index and re-index, because the . . .
. . .
is normally presented as part of the HTML header. If not available, or for files without header like .doc, .rtf, .PDF, .xls and .ptt files, the 'Preferred charset' (as defined in Admin settings) will be used to convert the file into Unicode. In other words: it is not possible to convert DOCs, PDFs etc. that are coded in 'foreign' charset. Only . . .
. . .
personal charset will be converted correctly. Also it is not possible to convert a Chinese and a Cyrillic coded PDF document at the same time. It is necessary to adapt the 'Preferred charset' before invoking the index procedure for the sites and their links to these documents. 2. By means of the PHP function 'iconv()' all texts will be . . .
. . .
all results from .html pages are shown first, all results from .php pages second, and finally all results from PDF documents at the end of the result listing. Controlled by the file /include/common/file_suffix.txt which contains a list of suffixes to be expected in result listing. The order of all suffixes in this list determines the . . .
. . .
definable: - By title (alphabetic) - By file suffix - By image size - By 'Last queried' - By 'Most popular' Top 8. PDF converter Starting with version 4 a new PDF converter is implemented in Sphider-plus. Realized as pure PHP script, the new script does no longer require the definition to individual path. The new converter indexes text and images . . .
. . .
does no longer require the definition to individual path. The new converter indexes text and images in not encoded PDFs. Top 9. Clean resources during index / re-index. In order to prevent performance problems and memory overflow for large amount of URLs, Sphider-plus may clean unused resources during index / re-index. Selectable in Admin . . .

3.   Sphider-plus - The PHP Search Engine Visit in a new window

6.9 Block queries 7. Chronological order for result listing 7.1 Text result listing 7.2 Media result listing 8. PDF converter 9. Clean resources during index / re-index. 10. Enable real-time output of logging data 11. Error messages and Debug mode 12. Delete secondary characters 13. Media search for images, audio streams and videos 13.1 Media . . .
. . .
and time of flood attempt. - Auto Re-index log file - Server info offering: Server software, environment, MySQL, PDF-converter, image functions, php.ini file PHP integration, PHP security info. Each item holding lists of details. All text links, media links and thumbnails are active linked. As stated in chapter Introduction , this search engine . . .
. . .
whether you like to use this web service for all pages to be indexed. More details are described in the readme.PDF documentation. 2.8 Prevent indexing of known malware and pishing pages This feature is a service, provided by Google that enables applications to check Internet URLs against Google's constantly updated lists of suspected phishing . . .
. . .
additionally requires an individual key. To be signed up at Google. More details are described in the readme.PDF documentation. 2.9 Follow Sitemap file To be activated in Admin settings, Sphider-plus will use the links found in sitemap.xml or sitemap.xml.gz files. This significantly increases the speed for index and re-index, because the . . .
. . .
is normally presented as part of the HTML header. If not available, or for files without header like .doc, .rtf, .PDF, .xls and .ptt files, the 'Preferred charset' (as defined in Admin settings) will be used to convert the file into Unicode. In other words: it is not possible to convert DOCs, PDFs etc. that are coded in 'foreign' charset. Only . . .
. . .
personal charset will be converted correctly. Also it is not possible to convert a Chinese and a Cyrillic coded PDF document at the same time. It is necessary to adapt the 'Preferred charset' before invoking the index procedure for the sites and their links to these documents. 2. By means of the PHP function 'iconv()' all texts will be . . .
. . .
all results from .html pages are shown first, all results from .php pages second, and finally all results from PDF documents at the end of the result listing. Controlled by the file /include/common/file_suffix.txt which contains a list of suffixes to be expected in result listing. The order of all suffixes in this list determines the . . .
. . .
definable: - By title (alphabetic) - By file suffix - By image size - By 'Last queried' - By 'Most popular' Top 8. PDF converter Starting with version 4 a new PDF converter is implemented in Sphider-plus. Realized as pure PHP script, the new script does no longer require the definition to individual path. The new converter indexes text and images . . .
. . .
does no longer require the definition to individual path. The new converter indexes text and images in not encoded PDFs. Top 9. Clean resources during index / re-index. In order to prevent performance problems and memory overflow for large amount of URLs, Sphider-plus may clean unused resources during index / re-index. Selectable in Admin . . .

4.   Sphider-plus - The PHP Search Engine Visit in a new window

New feature: Follow URL redirections caused by HTTP 301, 302, 303 and 307 status codes. New feature: Separated PDF converter supplied for 32 and 64 bit Operating Systems. For details, please notice chapter PDF converter for Linux/UNIX systems New feature: Follow links placed in JavaScript files. Will detect and follow links like . . .
. . .
Follow links placed in JavaScript files. Will detect and follow links like document.write(' <a href="new_12.PDF">All news 2012</a> '); Also the complete content of document.write( this text in all rows'); will be indexed and stored as keywords in db. New feature: Now indexing also sites, which do send a obligatory request for a . . .
. . .
in Admin Settings, the charset will be extracted from the header of the files to be indexed. If not found, like in PDF documents, the preferred charset will be used. New option: Delete duplicate parts of the URL path found in the indexed page URL and the new links. Unfortunately some CMS seem to be unable to build up a correct path for relative . . .
. . .
/admin/spider.php /admin/spiderfuncs.php /admin/url_backup.php /converter/feed_parser.php /converter/PDFtotext32.script /converter/PDFtotext64.script /include/click_counter.php /include/commonfuncs.php /include/domain_whois.php /include/idna_converter.php /include/media_counter.php /include/search_10.php /include/search_40.php . . .

5.   Sphider-plus - The PHP Search Engine Visit in a new window

of the word exceeding the defined length. Not inside a word at the character count limit defined in Admin setting. PDF converter for LINUX/UNIX Operating Systems included. Needs to be individualized according to readme.PDF documentation, chapter 'PDF converter for Linux server.' Thanks to rasc. Additional item in Admin section: Server Info To be . . .
. . .
To be found in submenu 'Statistics', important information are presented for: - Server - Environment - MySQL - PDF converter - php.ini file - PHP integration Enlarged Admin interface if database is empty. Improved printout for database connection problems. Now MySQL error message is included. Improved printout if text converter could not . . .
. . .
problems. Now MySQL error message is included. Improved printout if text converter could not extract words from PDF, DOC, XLS etc. files. Improved printout for Database Backup Management. Modified installation script. Thanks to Flemp. Font file renamed to captcha.tff (former: captcha.TTF). Thanks to ethix. All style sheets now are centralized . . .
. . .
/admin/real_get.php /admin/real_log.php /admin/real_ping.js /admin/spider.php /admin/spiderfuncs.php /converter/PDFtotext /converter/PDFtotext.script /include/captcha.tff /include/commonfuncs.php /include/searchfuncs.php /include/js_suggest/suggest.php /settings/conf.php /settings/database.php /templates/all_folders/navdown.jpg . . .

6.   Sphider-plus - The PHP Search Engine Visit in a new window

notice chapter Canonical <;link>; tag Index websites that are created with ASP.NET Definition for path to PDF converter integrated into Admin Settings interface. Additionally the default setting - as required for the Operating System environment - is suggested. If path to PDF converter is invalid and converter is not accessible, an error . . .

7.   Sphider-plus - The PHP Search Engine Visit in a new window

description New Admin setting: Index ZIP compressed files and archives. Supports (X)HTML, XML and also compressed PDFss and other document files, as well as all kind of feeds, frames and iframes. Links found in the compressed files will be followed. New option to sort the result listing: Sort by last indexed (date and time). To be defined in . . .
. . .
Malfunction will cause warning messages. Self test for up to date table structure of MySQL database. Self test of PDFs converter for correct addressing the converter and correct conversion of a test-file. Failures and malfunction will cause warning messages. Updated PDFs converter for non-Latin text like Arabic, Cyrillic, Chinese, Greece and . . .
. . .
/admin/install_tables.php /admin/messages.php /admin/spider.php /admin/spiderfuncs.php /converter/dummy.PDFs /converter/feed_parser.php /converter/PDFstotext.exe /converter/PDFstotext.script /converter/xPDFsrc /include/click_counter.php /include/commonfuncs.php /include/make_captcha.php /include/media_counter.php /include/searchfuncs.php . . .

8.   Sphider-plus - The PHP Search Engine Visit in a new window

to allocate yyy bytes)" Fatal error: "Uncaught Error: Class 'ZipArchive' not found . . ." Fatal error: Uncaught PdfToTextDecodingExceptionToTextDecodingException: PdfToTextDecodingException decoding error. PdfToTextDecodingException: Only document titles are indexed. PHP security info is not presented in Admin Statistics. What kind of input validation is performed (vulnerability)? How to protect Database . . .
. . .
bytes  Increase the value for ‘max_allowed_packet ‘in your SQL server e.g. to 16.777.216 for large text like in PdfToTextDecodingExceptions (e.g. 16 MB). The max allowed value is limited to 1.073.741.824 (1GB). The definition to be increased is to be found in /mysql/bin/my.ini Here you may define max_allowed_packet=16M Afterwards you need to restart your server. Top . . .
. . .
unlocked by the admin of Sphider-plus. How? Will not be discussed here on the Internet, Explained in the 'readme.PdfToTextDecodingException' documentation. Top Error message: Up to now, there is no valid database defined for user access In some rare cases of fresh Sphider-plus installations, the assignment of a database does not work probably. If this error message . . .
. . .
not granted to this admin backend. The solution for this error message is presented in chapter 22.4 of the readme.PdfToTextDecodingException docu, and will not be describedd here. Top Warning message: At present 'Site' options are not available. Please wait until the currently running indexation finished Usually it is no problem to abort the index procedure during . . .
. . .
directive: error_reporting = E_ALL & ~E_DEPRECATED & ~E_WARNING & ~E_NOTICE & ~E_STRICT; Top Fatal error: Uncaught PdfToTextDecodingExceptionToTextDecodingException: PdfToTextDecodingException decoding error . . . In your PHP environment of your server find the script /php.ini Open this script in your editor and uncomment the following 2 rows. Also increase the values of the 2 variables: . . .
. . .
of the 2 variables: .backtrack_limit=400000000 and .recursion_limit=40000000 Afterwards restart your server. Top PdfToTextDecodingException: Only document titles are indexed Are you sure it is a PdfToTextDecodingException document containing text? Or might it be an image, converted to PdfToTextDecodingException, which could not be indexed by Sphider-plus? In order to check it out: Open the document in your PdfToTextDecodingException . . .
. . .
the complete text is marked, and not only one word, the document is an image, which was just converted into PdfToTextDecodingException. PHP security info is not presented in Admin Statistics Unfortunately not all servers are supporting this feature. They take their security settings as a secret. A 'blank' admin is the typical response. As consequence, this . . .

9.   Sphider-plus - The PHP Search Engine Visit in a new window

of document.write Will index JavaScript commands. Detect and follow links like: document.write(' <a href="new12.pdf>All">All news 2012</a> '); and index the content of: document.write(' this content '); Not indexing content created in real-time by JavaScript. Accept gzip formatted transmission In order to reduce the transfer time, gzip . . .
. . .
by the crawler. Index of RAR and ZIP compressed files and archives Supports compressed (X)HTML, XML and also pdf>Alls, all kind of feeds, frames and iframes in archives. Links found in the compressed files are followed. Converter included for pdf>All, DOCX, XLSX, ODT, ODS, CSV, PPTX and XLS files Converting also non-Latin text like: Arabic, . . .

URL: http://sphider-plus.eu/ - 25.6 kb

10.   Sphider-plus - The PHP Search Engine Visit in a new window

of document.write Will index JavaScript commands. Detect and follow links like: document.write(' <a href="new12.pdf>All">All news 2012</a> '); and index the content of: document.write(' this content '); Not indexing content created in real-time by JavaScript. Accept gzip formatted transmission In order to reduce the transfer time, gzip . . .
. . .
by the crawler. Index of RAR and ZIP compressed files and archives Supports compressed (X)HTML, XML and also pdf>Alls, all kind of feeds, frames and iframes in archives. Links found in the compressed files are followed. Converter included for pdf>All, DOCX, XLSX, ODT, ODS, CSV, PPTX and XLS files Converting also non-Latin text like: Arabic, . . .
Result page:1 2 Next

Most popular queries

Query Count Results Last queried
sphider 5 63 2024-04-19 11:04:35
cookies 4 2 2024-04-19 16:49:30
debug 2 14 2024-04-18 21:31:57
germany 2 1 2024-04-17 16:46:21
hold 2 3 2024-04-19 12:29:47

Top

Visit Visit Sphider site in new window Sphider-plus