BOOK: a heuristic anomaly-detection script
From: Julian Foad <julianfoad_at_btopenworld.com>
Date: 2004-10-03 20:55:05 CEST
That's my grand name for a few "grep"s hacked together.
When I see or hear of an anomaly in the Subversion book source XML files, like unwanted spaces before </para> (thanks to Chris Pepper for pointing this one out), I always think there are likely to be more of them, and I see if I can find a way to detect them automatically. The attached BASH shell script "svn-book-anomalies.sh" is what I have hacked together over the last year or so. It employs horribly inexact methods of detection and of filtering out the XML mark-up from the real text - but it works well enough to find a few mistakes.
In particular, this finds several more spaces before </para>, and several places where double quote characters are used instead of the correct <quote>, and several occurrences of plain "svn" where it should be "Subversion", "<command>svn</command>", etc.
Please feel free to use it and/or improve it. If you can make it filter out the mark-up properly with 'xsltproc' or whatever, that would be great. Just don't check it in to the repository in its present, brittle, ugly form.
- Julian
---------------------------------------------------------------------
|
This is an archived mail posted to the Subversion Dev mailing list.
This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.