Merge branch 'patches/bug10020' into 8.0 * patches/bug10020: Search.Filter.HTML: Fix title always being set to "".
Merge branch 'patches/bug10020' * patches/bug10020: Search.Filter.HTML: Fix title always being set to "".
Search.Filter.HTML: Fix title always being set to "". Fixes #10020.
Merge branch 'patches/bug10020' into 8.0 * patches/bug10020: Search.Filter.HTML: Return the first <title>.
Merge branch 'patches/bug10020' * patches/bug10020: Search.Filter.HTML: Return the first <title>.
Search.Filter.HTML: Return the first <title>. There may be multiple <title>-tags in an html document where the later are related to SVG. Fixes #10020.
Merge commit '722771973bd' into patches/lyslyskom22891031 * commit '722771973bd': (6177 commits) Verify that callablep responses are aligned with reality. ...
Merge commit '2470270f500c728d10b8895314d8d8b07016e37b' into grubba/typechecker-automap * commit '2470270f500c728d10b8895314d8d8b07016e37b': (18681 commits) Removed the old typechecker. ...
Merge remote-tracking branch 'origin/8.1' into gobject-introspection
Merge commit '75c9d1806f1a69ca21c27a2c2fe1b4a6ea38e77e' into patches/pike63 * commit '75c9d1806f1a69ca21c27a2c2fe1b4a6ea38e77e': (19587 commits) ...
Search.Filter.HTML: More work arounds for Parser.HTML leak. This changes a few more callbacks to returning the empty array instead of returning zero. Work around for PIKE-14 (#8014).
Search.Filter.HTML: Work around for Parser.HTML leak. This changes a few callbacks to returning the empty array instead of returning zero. Work around for PIKE-14 (#8014).
Removed trailing spaces.
static -> protected
Removed some old compat code.
Ops. Didn't check in the last changes.
Removed some useless version checks. Only relevant if you would moce the code to an older Pike binary.
Removed $Id$.
Don't dump Search modules that depend on Roxen.
Reverted the move of Search.pmod from the Roxen module to Pike. It stays in the Roxen Search module for Roxen <= 4.5, for compatibility. The C-level part remains in Pike.
More Roxen -> Pike adaptions in Search.pmod.
Merge branch 'pike-modules/whitefish/4.5' into 7.4
Merge commit 'whitefish_split_4.5_5.0' into whitefish-fixup This is in preparation to fixup Search.Process as well.
Merge remote branch 'pike-modules/whitefish/5.0' into 7.9 Updated Search.Utils and Search.Process to a non-roxen context. Conflicts: .gitattributes .gitignore lib/modules/.autodoc
Merge branch 'pike-modules/whitefish/5.0' into 7.8 Updated Search.Utils and Search.Process to a non-roxen context. Conflicts: .gitattributes .gitignore lib/modules/.autodoc
Fixed support for Pike 7.4. Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.46
Refactored handling of meta http-equviv. Now looks at the real headers as well. Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.45
Updated copyright notices. Rev: lib/modules/Search.pmod/Database.pmod/Base.pike:1.7 Rev: lib/modules/Search.pmod/Database.pmod/MySQL.pike:1.89 Rev: lib/modules/Search.pmod/Filter.pmod/Base.pike:1.18 Rev: lib/modules/Search.pmod/Filter.pmod/Charset.pmod:1.7 Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.44 Rev: lib/modules/Search.pmod/Filter.pmod/PDF.pmod:1.17 Rev: lib/modules/Search.pmod/Filter.pmod/PlainText.pmod:1.10 Rev: lib/modules/Search.pmod/Filter.pmod/Word.pmod:1.13 Rev: lib/modules/Search.pmod/Grammar.pmod/AbstractParser.pike:1.8 Rev: lib/modules/Search.pmod/Grammar.pmod/DefaultParser.pike:1.14 Rev: lib/modules/Search.pmod/Grammar.pmod/Lexer.pmod:1.9 Rev: lib/modules/Search.pmod/Grammar.pmod/debug.h:1.4 Rev: lib/modules/Search.pmod/Grammar.pmod/module.pmod:1.16 Rev: lib/modules/Search.pmod/Indexer.pmod:1.19 Rev: lib/modules/Search.pmod/Query.pmod:1.36 Rev: lib/modules/Search.pmod/RankingProfile.pike:1.21 Rev: lib/modules/Search.pmod/ResultSet.pike:1.3 Rev: lib/modules/Search.pmod/Utils.pmod:1.49 Rev: lib/modules/Search.pmod/module.pmod:1.18 Rev: lib/modules/Search.pmod/search.h:1.2
Updated copyright notices. Rev: lib/modules/Search.pmod/Database.pmod/Base.pike:1.7 Rev: lib/modules/Search.pmod/Database.pmod/MySQL.pike:1.88 Rev: lib/modules/Search.pmod/Filter.pmod/Base.pike:1.18 Rev: lib/modules/Search.pmod/Filter.pmod/Charset.pmod:1.7 Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.44 Rev: lib/modules/Search.pmod/Filter.pmod/PDF.pmod:1.17 Rev: lib/modules/Search.pmod/Filter.pmod/PlainText.pmod:1.10 Rev: lib/modules/Search.pmod/Filter.pmod/Word.pmod:1.13 Rev: lib/modules/Search.pmod/Grammar.pmod/AbstractParser.pike:1.8 Rev: lib/modules/Search.pmod/Grammar.pmod/DefaultParser.pike:1.14 Rev: lib/modules/Search.pmod/Grammar.pmod/Lexer.pmod:1.9 Rev: lib/modules/Search.pmod/Grammar.pmod/debug.h:1.4 Rev: lib/modules/Search.pmod/Grammar.pmod/module.pmod:1.16 Rev: lib/modules/Search.pmod/Indexer.pmod:1.19 Rev: lib/modules/Search.pmod/Query.pmod:1.36 Rev: lib/modules/Search.pmod/RankingProfile.pike:1.20 Rev: lib/modules/Search.pmod/ResultSet.pike:1.3 Rev: lib/modules/Search.pmod/Utils.pmod:1.48 Rev: lib/modules/Search.pmod/module.pmod:1.18 Rev: lib/modules/Search.pmod/search.h:1.2
Extract all <meta /> tags from pages to allow indexing for any generic ones, instead of just a few special ones. Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.43
<!-- robots:noindex --> didn't properly exclude some HTML elements such as <h1>-<h3> and <title>. Fixes [RT#9018]. Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.42
HTML-entities shouldn't be present in hrefs added to linkfarms. Fixes [Bug 4280 (#4280)] Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.41
<!-- robots:nofollow -->...<!-- /robots:nofollow --> Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.40
More robust handling of missing <meta> attributes. Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.39
Don't throw error when unknown entities are encountered in title or metadata fields. Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.38
Added alternative to <noindex> tags: <!-- robots:noindex -->...<!-- /robots:noindex --> Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.37
Added alternative to <noindex> tags: <!-- robots:noindex -->...<!-- /robots:noindex --> Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.38
Merge from external js/per/nilsson fork Rev: lib/modules/Search.pmod/Database.pmod/Base.pike:1.5 Rev: lib/modules/Search.pmod/Database.pmod/Base.pmod:1.20(DEAD) Rev: lib/modules/Search.pmod/Database.pmod/MySQL.pike:1.77 Rev: lib/modules/Search.pmod/Filter.pmod/Base.pike:1.16 Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.37 Rev: lib/modules/Search.pmod/Filter.pmod/Output.pike:1.1 Rev: lib/modules/Search.pmod/Filter.pmod/PDF.pmod:1.16 Rev: lib/modules/Search.pmod/Filter.pmod/PlainText.pmod:1.9 Rev: lib/modules/Search.pmod/Filter.pmod/Word.pmod:1.12 Rev: lib/modules/Search.pmod/Grammar.pmod/AbstractParser.pike:1.7 Rev: lib/modules/Search.pmod/Grammar.pmod/DefaultParser.pike:1.11 Rev: lib/modules/Search.pmod/Grammar.pmod/Lexer.pmod:1.7 Rev: lib/modules/Search.pmod/Grammar.pmod/module.pmod:1.12 Rev: lib/modules/Search.pmod/Indexer.pmod:1.17 Rev: lib/modules/Search.pmod/MergeFile.pike:1.1 Rev: lib/modules/Search.pmod/Process.pmod:1.12 Rev: lib/modules/Search.pmod/Query.pmod:1.26 Rev: src/modules/_WhiteFish/Makefile.in:1.7 Rev: src/modules/_WhiteFish/blob.c:1.34 Rev: src/modules/_WhiteFish/blobs.c:1.12 Rev: src/modules/_WhiteFish/buffer.c:1.13 Rev: src/modules/_WhiteFish/linkfarm.c:1.5 Rev: src/modules/_WhiteFish/resultset.c:1.24 Rev: src/modules/_WhiteFish/resultset.h:1.9 Rev: src/modules/_WhiteFish/whitefish.c:1.35
Don't throw error when unknown entities are encountered. Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.36
Fixed html comment problem in indexer. The Indexer does now handle html-comments with single quote characters. Before nothing below sutch a comment was ignored by the indexer. Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.35
Oops, fixed typo. Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.34
HTML-entities shouldn't be present in hrefs added to linkfarms... Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.33
Fixed a bug where names of unknown tags were indexed as text. Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.32
Increase compatibility with e.g. <script> tags which may contain literal "<" in the tag data. Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.31
Ported from 3.3: Fixed [Bug 3258 (#3258)]. Added support for multipple <meta name="robots">-tags. Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.29
Fixed [Bug 3258 (#3258)]. Added support for multipple <meta name="robots">-tags. Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.30
Don't index <a title="..."> when inside <noindex>. Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.29
Word delimiters added for e.g. <a title="...">. Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.28
Add spacing between separate text fragments to preserve word delimiters. Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.24
Add spacing between separate text fragments to preserve word delimiters. Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.27
Don't index alt texts Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.26
Fixed the summary Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.25
Added support for more intrawise metadata types Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.24
Removed debug Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.23
Now follows links found inside <no-index>..</no-index>, unless the nofollow attribute is specified. Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.22
Fixed h1,h2,h3 parsing. Now also indexes alt attributes to img tags correctly. Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.21
Lagt till HTML-taggarna embed och xml. Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.20
Added a lot more knowledge about HTML Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.19
Index alt-texts and follow <frame[set]> tags Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.18
Handle <meta name="robots">. Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.17
Use Parser.parse_html_entities and some other fixes. Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.16
Added rank tag. Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.15
Addded requiered fields Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.14
Adding copyright message Rev: lib/modules/Search.pmod/Database.pmod/Base.pmod:1.6 Rev: lib/modules/Search.pmod/Database.pmod/MySQL.pike:1.34 Rev: lib/modules/Search.pmod/Filter.pmod/Base.pike:1.7 Rev: lib/modules/Search.pmod/Filter.pmod/Charset.pmod:1.4 Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.13 Rev: lib/modules/Search.pmod/Filter.pmod/PlainText.pmod:1.5 Rev: lib/modules/Search.pmod/Grammar.pmod/AbstractParser.pike:1.5 Rev: lib/modules/Search.pmod/Grammar.pmod/DefaultParser.pike:1.7 Rev: lib/modules/Search.pmod/Grammar.pmod/Lexer.pmod:1.5 Rev: lib/modules/Search.pmod/Grammar.pmod/debug.h:1.2 Rev: lib/modules/Search.pmod/Grammar.pmod/module.pmod:1.9 Rev: lib/modules/Search.pmod/Indexer.pmod:1.12 Rev: lib/modules/Search.pmod/Query.pmod:1.16 Rev: lib/modules/Search.pmod/RankingProfile.pike:1.13 Rev: lib/modules/Search.pmod/ResultSet.pike:1.2 Rev: lib/modules/Search.pmod/Utils.pmod:1.5 Rev: lib/modules/Search.pmod/module.pmod:1.15
The Search module will be available in roxen thanks to the pike path finder. Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.12 Rev: lib/modules/Search.pmod/Filter.pmod/PlainText.pmod:1.4 Rev: lib/modules/Search.pmod/Grammar.pmod/AbstractParser.pike:1.4 Rev: lib/modules/Search.pmod/Grammar.pmod/DefaultParser.pike:1.6 Rev: lib/modules/Search.pmod/Grammar.pmod/Lexer.pmod:1.4 Rev: lib/modules/Search.pmod/Grammar.pmod/module.pmod:1.8 Rev: lib/modules/Search.pmod/Indexer.pmod:1.11 Rev: lib/modules/Search.pmod/Query.pmod:1.15 Rev: lib/modules/Search.pmod/RankingProfile.pike:1.12 Rev: lib/modules/Search.pmod/module.pmod:1.14
Disabled the ignore_unknown flag, since it caused the filtered body text to be filled with all the tags from the input. Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.11
Not works without correct module_path Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.10 Rev: lib/modules/Search.pmod/Filter.pmod/PlainText.pmod:1.3 Rev: lib/modules/Search.pmod/Grammar.pmod/AbstractParser.pike:1.3 Rev: lib/modules/Search.pmod/Grammar.pmod/DefaultParser.pike:1.3 Rev: lib/modules/Search.pmod/Grammar.pmod/Lexer.pmod:1.2 Rev: lib/modules/Search.pmod/Grammar.pmod/module.pmod:1.4 Rev: lib/modules/Search.pmod/Indexer.pmod:1.10 Rev: lib/modules/Search.pmod/Query.pmod:1.11 Rev: lib/modules/Search.pmod/RankingProfile.pike:1.9 Rev: lib/modules/Search.pmod/module.pmod:1.13
Marked lots of symbols static Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.9 Rev: lib/modules/Search.pmod/RankingProfile.pike:1.8 Rev: lib/modules/Search.pmod/Utils.pmod:1.4
some optimizations Rev: lib/modules/Search.pmod/Filter.pmod/Charset.pmod:1.2 Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.8
Charset detection added Rev: lib/modules/Search.pmod/Filter.pmod/Base.pike:1.6 Rev: lib/modules/Search.pmod/Filter.pmod/Charset.pmod:1.1 Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.7
Optimized 30% for common cases Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.6
Removed some debug Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.5
Use the C-level links mapping Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.4
Better looking debug printouts and threaded sync() Rev: lib/modules/Search.pmod/Database.pmod/MySQL.pike:1.19 Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.3 Rev: lib/modules/Search.pmod/Indexer.pmod:1.4
Major speed optimization overhaul Rev: lib/modules/Search.pmod/Database.pmod/MySQL.pike:1.18 Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.2 Rev: lib/modules/Search.pmod/Indexer.pmod:1.3 Rev: lib/modules/Search.pmod/Utils.pmod:1.2 Rev: src/modules/_WhiteFish/blob.c:1.20 Rev: src/modules/_WhiteFish/blob.h:1.10 Rev: src/modules/_WhiteFish/blobs.c:1.3
Moved to .pmods Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pike:1.6(DEAD) Rev: lib/modules/Search.pmod/Filter.pmod/HTML.pmod:1.1 Rev: lib/modules/Search.pmod/Filter.pmod/PlainText.pike:1.5(DEAD) Rev: lib/modules/Search.pmod/Filter.pmod/PlainText.pmod:1.1