summaryrefslogtreecommitdiffstats
path: root/preconv.c
Commit message (Collapse)AuthorAgeFilesLines
* Some small lint checks in preconv. Also add it to the default lint rule.Kristaps Dzonsons2011-05-261-5/+5
|
* preconv is now on encoding-recognition parity with groff. This lastKristaps Dzonsons2011-05-261-2/+104
| | | | | | | | | commit adds parsing of "File Variables" in the first two lines in order to grok the encoding. This completes groff's recognition sequence (-e, BOM, File variables, -D, default). I've also cleaned up the manual to indicate this and for some general readability. preconv is now compiled by default in the Makefile.
* Significantly improve preconv. Allow it to recode UTF-8 characters intoKristaps Dzonsons2011-05-261-12/+116
| | | | | | | | | | | | | the \[uNNNN] strings (taking into account big-endian archs). Also allow it to determine from the BOM whether it's a UTF-8 file. Also add the initial manual. This has been tested over a random selection of UTF-8 documents, as % preconv -e utf-8 foo.1 | ./mandoc -Tlocale where -Tlocale is allowed (-DUSE_WCHAR). Note that we're still missing the "type" indicator that preconv accepts.
* It's annoying that we don't have preconv, so throw together a quickKristaps Dzonsons2011-05-261-0/+316
version and let it grow in-tree. Right now, this only supports the Latin-1 and US-ASCII encoding. I'll do UTF-8 next. It's call-compatible with GNU's preconv although I don't do fancy stuff like BOM or header check. This will come. I used read.c's file-grokking code.