corpus settings

Corpus switches control XML data for output to a local search engine.

corpus.article Prefer the content of <ARTICLE> when gathering corpus text.
corpus.body Prefer the content of <BODY> when gathering corpus text. This is the default.
corpus.main Prefer the content of <MAIN> when gathering corpus text.
corpus.output file
-d file
Dump XML corpus of site into file. This is intended for use by a local search engine. If none of --corpus.article, --corpus.body, or --corpus.main are specified, the content of <BODY> is used. If more than one are specified, then the text collected depends on a page’s content. This is incompatible with --shadow.update.
atom · config · corpus · CSS · env · general · HTML · JSON-LD · link · MathML · microformat · nits · ontology
output · RSL · RSS · settings · shadow · shell · site · spell · SSI · stats · SVG · templates · validate · VTT

build · introduction · releases · settings · usage · why