diff options
-rw-r--r-- | README.md | 4 | ||||
-rw-r--r-- | xml2tsv.1 | 67 |
2 files changed, 69 insertions, 2 deletions
@@ -45,8 +45,8 @@ Customisation ============= It is possible to customise several aspects of the output by changing -values in `config.h`. Please see the comments within that file for more -details. +values in `config.h`. Please see the comments within that file or read +the manpage for more details. Copying ======= diff --git a/xml2tsv.1 b/xml2tsv.1 new file mode 100644 index 0000000..f071e2a --- /dev/null +++ b/xml2tsv.1 @@ -0,0 +1,67 @@ +.TH XML2TSV 1 "04/01/2020" "" "" +.SH NAME +xml2tsv \- simple xml-to-tsv converter +.SH SYNOPSIS +.PP +xml2tsv +.PP +.SH DESCRIPTION +.PP +xml2tsv is a simple tool to convert XML given on standard input to a list of +tab-separated-values records. Each XML entry is converted to a line +like: +.EX + + /full/path/to/current/entry[TAB]attr1=value1[TAB]attr2=value2[TAB]...[TAB]data[NEWLINE] + +.EE +where +.I "/full/path/to/current/entry" +represents the full hierarchy of entries down to the current one. For +instance, the XML snippet: +.EX + + <html> + <head> + <title>This is a title</title> + </head> + <body> + <h1>It works!</h1> + <a href="https://my.wonderful.website.net">Click here</a> + </body> + </html> + +.EE +will produce the output: +.EX + + /html + /html/head + /html/head/title This is a title + /html/body + /html/body/h1 It works! + /html/body/a href=https://my.wonderful.website.net Click here + +.EE +By default, xml2tsv quotes '\\n', '\\t', and '\\', and strips other +control characters. +.SH CONFIGURATION +The maximum length of an entry name and the maximum depth of an entry +are fixed to STR_MAX and DEPTH_MAX, and can be changed by editing the +file +.BI config.h +and rebuilduing xml2tsv. It is also possible to change the separator +used on output (SEP, by default set to '\\t'), and the character used to +separate the name of an attribute from its value (SATTR, by default set +to '='). +.SH BUGS + +xml2tsv currently lacks an option to force printing control characters +on output, if desired. + +.SH AUTHORS +xml2tsv is written and maintained by Vincenzo "KatolaZ" Nicosia +<katolaz@freaknet.org>. The code is based on +.BI xmlparser +by Hiltjo Posthuma <hiltjo@codemadness.org>. You can use, distribute, +modify, and redistribute xml2tsv under the terms of the ISC License. |