Change History
Releases
- Version 0.6.4 (15 Jun 2002)
[zip]
[tgz]
- Re-organized package contents for integration into the CyberNeko
Tools for XNI package;
fixed table closing bug reported by Oskar Liljeblad;
fixed newline bug reported by OtisG;
and
fixed line counting bug reported by Donald Ball.
- Version 0.6.3 (29 May 2002)
[zip]
[tgz]
- Fixed bug in handling of <th> elements reported by
Oskar Liljeblad;
and
fixed various tag-balancing problems.
- Version 0.6.2 (26 May 2002)
[zip]
[tgz]
- Changed scanner behavior as requested by Alexey Shananin to
report malformed start elements (e.g. <...>) as
characters
and
fixed tag balancing bug introduced in previous version. Oops!
- Version 0.6.1 (23 May 2002)
[zip]
[tgz]
- Changed tag balancer behavior to swallow events after the close
of the <html> tag to ensure that the document stream
remains well-formed;
added additional Ruby elements;
and
improved tag balancer performance.
- Version 0.6 (12 May 2002)
[zip]
[tgz]
- Added property to allow custom document filters to be appended
to the default NekoHTML parser pipeline;
added convenience filters for serializing HTML documents and
removing elements from the document event stream;
added samples to demonstrate the filtering feature;
added experimental functionality to
allow applications to dynamically insert content into the
HTML document stream;
added a minimal Xerces2 Jar file containing just the files
required for using the HTMLConfiguration class directly to
alleviate full dependence on Xerces2 distribution;
applied patch from Serge Proskuryakov to fix handling of
misplaced <title> within <body>;
fixed minor tag balancing bug;
and
re-organized and added new documentation.
- Version 0.5 (07 May 2002)
[zip]
[tgz]
- Fixed some location reporting information bugs and added
feature to report character boundaries of events via the
associated augmentations object;
added feature to disable tag balancing;
and
added features to notify handlers of start and end of character
and built-in XML and HTML entity references.
- Version 0.4.1 (03 May 2002)
[zip]
[tgz]
- Fixed some unquoted attribute value scanning bugs reported
by Xiaowei Jiang;
fixed hack for Xerces 2.0.1 reported by Ron Cemer;
now passing locator object to
startDocument
method;
and
celebrated opening of the Spider-Man movie.
- Version 0.4 (14 Apr 2002)
[zip]
[tgz]
- Added properties to control case of element and attribute names;
changed behavior of parser so that only known HTML elements
have their names modified according to the properties — all
unknown tags are left as-is;
added property to set default encoding;
added feature to augment infoset to report "synthesized" events;
added feature to be able to report errors and localized the error
messages;
implemented the locator so that location information can be
reported;
and
fixed element information so that more elements are properly
scanned as "special".
- Version 0.3.3 (02 Apr 2002)
[zip]
[tgz]
- Separated META-INF/services/* files to separate Jar
so that HTML parser configuration selection can be controlled
more explicitly; added DOM and SAX parser classes for
convenience; and fixed bug so that parser now obeys the
encoding specified in the input source.
- Version 0.3.2 (15 Mar 2002)
[zip]
[tgz]
- Fixed problem with bare <input> elements appearing outside
of <form> tag.
- Version 0.3.1 (07 Mar 2002)
[zip]
[tgz]
- Fixed handling of bare ampersands in content and attribute
values.
- Version 0.3 (25 Feb 2002)
[zip]
[tgz]
- Changed license to an Apache style license and fixed a
few bugs.
- Version 0.2.3 (19 Feb 2002)
[zip]
[tgz]
- Nested tables bug fix.
- Version 0.2.2 (17 Feb 2002)
[zip]
[tgz]
- More bug fixes to allow the parser to be used with Xalan
2.3.0. The parser wasn't keeping track of features and
properties and without namespaces turned on, Xalan would
not correctly transform the SAX events emitted using
NekoHTML.
- Version 0.2.1 (16 Feb 2002)
[zip]
[tgz]
- Minor bug fix to work around problem in Xerces 2.0.0 SAX
parser that drops attributes when parser configuration
doesn't have a symbol table.
- Version 0.2 (14 Feb 2002)
[zip]
[tgz]
- Adding support for UTF-8, UTF-16, and other 8-bit encodings
supported by Java.
- Version 0.1 (04 Feb 2002)
[zip]
[tgz]
- Initial writing.
(C) Copyright 2002, Andy Clark. All rights reserved.