-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Format: 1.7 Date: Tue, 08 Jan 2008 05:13:31 +0200 Source: pandoc Binary: pandoc-doc libghc6-pandoc-dev pandoc Architecture: source all i386 Version: 0.46 Distribution: unstable Urgency: low Maintainer: Recai Oktaş <roktas@debian.org> Changed-By: Recai Oktaş <roktas@debian.org> Description: libghc6-pandoc-dev - general markup converter pandoc - general markup converter pandoc-doc - general markup converter Changes: pandoc (0.46) unstable; urgency=low . [ John MacFarlane ] . * Made -H, -A, and -B options cumulative: if they are specified multiple times, multiple files will be included. . * Added optional HTML sanitization using a whitelist. When this option is specified (--sanitize-html on the command line), unsafe HTML tags will be replaced by HTML comments, and unsafe HTML attributes will be removed. This option should be especially useful for those who want to use pandoc libraries in web applications, where users will provide the input. . + Main.hs: Added --sanitize-html option. . + Text.Pandoc.Shared: Added stateSanitizeHTML to ParserState. . + Text.Pandoc.Readers.HTML: - Added whitelists of sanitaryTags and sanitaryAttributes. - Added parsers to check these lists (and state) to see if a given tag or attribute should be counted unsafe. - Modified anyHtmlTag and anyHtmlEndTag to replace unsafe tags with comments. - Modified htmlAttribute to remove unsafe attributes. - Modified htmlScript and htmlStyle to remove these elements if unsafe. . + Modified README and man pages to document new option. . * Improved handling of email addresses in markdown and reStructuredText. Consolidated uri and email address parsers. (Resolves Issue #37.) . + New emailAddress and uri parsers in Text.Pandoc.Shared. - uri parser uses parseURI from Network.URI. - emailAddress parser properly handles email addresses with periods in them. . + Removed uri and emailAddress parsers from Text.Pandoc.Readers.RST and Text.Pandoc.Readers.Markdown. . * Markdown reader: . + Fixed emph parser so that "*hi **there***" is parsed as a Strong nested in an Emph. (A '*' is only recognized as the end of the emphasis if it's not the beginning of a strong emphasis.) . + Moved blockQuote parser before list parsers for performance. . + Modified 'source' parser to allow backslash-escapes in URLs. So, for example, [my](/url\(1\)) yields a link to /url(1). Resolves Issue #34. . + Disallowed links within links. (Resolves Issue #35.) - Replaced inlinesInBalanced with inlinesInBalancedBrackets, which instead of hard-coding the inline parser takes an inline parser as a parameter. - Modified reference and inlineNote to use inlinesInBalancedBrackets. - Removed unneeded inlineString function. - Added inlineNonLink parser, which is now used in the definition of reference. - Added inlineParsers list and redefined inline and inlineNonLink parsers in terms of it. - Added failIfLink parser. . + Better handling of parentheses in URLs and quotation marks in titles. - 'source' parser first tries to parse URL with balanced parentheses; if that doesn't work, it tries to parse everything beginning with '(' and ending with ')'. - source parser now uses an auxiliary function source'. - linkTitle parser simplified and improved, under assumption that it will be called in context of source'. . + Make 'block' conditional on strictness state, instead of using failIfStrict in block parsers. Use a different ordering of parsers in strict mode (raw HTML block before paragraph) for performance. In non-strict mode use rawHtmlBlocks instead of htmlBlock. Simplified htmlBlock, since we know it's only called in strict mode. . + Improved handling of raw HTML. (Resolves Issue #36.) - Tags that can be either block or inline (e.g. <ins>) should be treated as block when appropriate and as inline when appropriate. Thus, for example, <ins>hi</ins> should be treated as a paragraph with inline <ins> tags, while <ins> hi </ins> should be treated as a paragraph within <ins> tags. - Moved htmlBlock after para in list of block parsers. This ensures that tags that can be either block or inline get parsed as inline when appropriate. - Modified rawHtmlInline' so that block elements aren't treated as inline. - Modified para parser so that paragraphs containing only HTML tags and blank space are not allowed. Treat these as raw HTML blocks instead. . + Fixed bug wherein HTML preceding a code block could cause it to be parsed as a paragraph. The problem is that the HTML parser used to eat all blank space after an HTML block, including the indentation of the code block. (Resolves Issue #39.) - In Text.Pandoc.Readers.HTML, removed parsing of following space from rawHtmlBlock. - In Text.Pandoc.Readers.Markdown, modified rawHtmlBlocks so that indentation is eaten *only* on the first line after the HTML block. This means that in <div> foo <div> the foo won't be treated as a code block, but in <div> . foo . </div> it will. This seems the right approach for least surprise. . * RST reader: . + Fixed bug in parsing explicit links (resolves Issue #44). The problem was that we were looking for inlines until a '<' character signaled the start of the URL; so, if you hit a reference-style link, it would keep looking til the end of the document. Fix: change inline => (notFollowedBy (char '`') >> inline). Note that this won't allow code inlines in links, but these aren't allowed in resT anyway. . + Cleaned up parsing of reference names in key blocks and links. Allow nonquoted reference links to contain isolated '.', '-', '_', so so that strings like 'a_b_' count as links. . + Removed unnecessary check for following link in str. This is unnecessary now that link is above str in the definition of 'inline'. . * HTML reader: . + Modified rawHtmlBlock so it parses </html> and </body> tags. This allows these tags to be handled correctly in Markdown. HTML reader now uses rawHtmlBlock', which excludes </html> and </body>, since these are handled in parseHtml. (Resolves Issue #38.) . + Fixed bug (emph parser was looking for <IT> tag, not <I>). . + Don't interpret contents of style tags as markdown. (Resolves Issue #40.) - Added htmlStyle, analagous to htmlScript. - Use htmlStyle in htmlBlockElement and rawHtmlInline. - Moved "script" from the list of tags that can be either block or inline to the list of block tags. . + Modified rawHtmlBlock to use anyHtmlBlockTag instead of anyHtmlTag and anyHtmlEndTag. This fixes a bug in markdown parsing, where inline tags would be included in raw HTML blocks. . + Modified anyHtmlBlockTag to test for (not inline) rather than directly for block. This allows us to handle e.g. docbook in the markdown reader. . * LaTeX reader: Properly recognize --parse-raw in rawLaTeXInline. Updated LaTeX reader test to use --parse-raw. . * HTML writer: . + Modified rules for automatic HTML header identifiers to ensure that identifiers begin with an alphabetic character. The new rules are described in README. (Resolves Issue #33.) . + Changed handling of titles in HTML writer so you don't get "titleprefix - " followed by nothing. . * ConTeXt writer: Use wrappers around Doc elements to ensure proper spacing. Each block element is wrapped with either Pad or Reg. Pad'ed elements are guaranteed to have a blank line in between. . * RST writer: . + Refactored RST writer to use a record instead of a tuple for state, and to include options in state so it doesn't need to be passed as a parameter. . + Use an interpreted text role to render math in restructuredText. See http://www.american.edu/econ/itex2mml/mathhack.rst for the strategy. . [ Recai Oktaş ] . * Debian packaging changes: . + Remove the empty 'include' directory in -dev package, which lintian complains about. + Bump Standarts-Version to 3.7.3. + Use new 'Homepage:' field to specify the upstream URL on suggestion of lintian. Files: d215193f7822c5ff8770c980f52a6927 848 text optional pandoc_0.46.dsc b218ee905bc40d2b7ebb8b23a87a6efb 349904 text optional pandoc_0.46.tar.gz d1f97997345bd71abd3d5961aa8a39d1 59194 doc optional pandoc-doc_0.46_all.deb 2304adf69e7e90194924ac96a317508b 642404 text optional pandoc_0.46_i386.deb 401a140f8242ce473636a9613d13e5d2 1297368 libdevel optional libghc6-pandoc-dev_0.46_i386.deb -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.6 (GNU/Linux) iD8DBQFHg7wrnA44mz/SXIQRAk/KAJ9eWo1G+zufz78NdIoFuuwkEiFWDACcCHf4 le6FyRufdE3xYbfnfEuN8xw= =8hIj -----END PGP SIGNATURE----- Accepted: libghc6-pandoc-dev_0.46_i386.deb to pool/main/p/pandoc/libghc6-pandoc-dev_0.46_i386.deb pandoc-doc_0.46_all.deb to pool/main/p/pandoc/pandoc-doc_0.46_all.deb pandoc_0.46.dsc to pool/main/p/pandoc/pandoc_0.46.dsc pandoc_0.46.tar.gz to pool/main/p/pandoc/pandoc_0.46.tar.gz pandoc_0.46_i386.deb to pool/main/p/pandoc/pandoc_0.46_i386.deb