Skip to content

Latest commit

 

History

History
 
 

doctools

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Doctools

Tools we use to generate the Oils documentation. Some of this code is used to build the the blog as well.

See doc/doc-toolchain.md for details.

Tools shared with the blog:

  • cmark.py: Our wrapper around CommonMark.
  • spelling.py: spell checker
  • split_doc.py: Split "front matter" from Markdown.

More tools:

  • html_head.py: Common HTML fragments.
  • oil_doc.py: HTML filters.
  • help_gen.py: For doc/ref/index-{osh,ysh}.md.

Micro Syntax

  • src_tree.py is a fast and minimal source viewer.
  • It uses polyglot syntax analysis called "micro syntax". See micro-syntax.md.

TODO

Immediate:

  • iPhone CSS

    • font sizes are wrong when the line wraps
    • need to debug this directly
  • Experiment with shared library -- narrow waist that eliminates process overhead

    • main(argv)
    • stdin / stdout are buffers?
      • for the coprocess protocol, I wanted file descriptors. But this is a portable interface
    • returns status code
    • what about errors?
      • I think they can go to stderr to start
  • Ninja makes it faster

  • CLI syntax

    • modes: wc and cat
    • --col and --omit-col
    • mtax cat --no-comments --no-strs or --empty-strs
      • replace with spaces
      • reminds me of "garfield minus garfield"
  • output: 4 columns

    • which I guess are selectable
    • should tokens be binary data though?
  • Detect UTF-8

    • lexing doesn't work without UTF-8
  • Analyze TSV for function names in Python parser combinators

    • add () {} -- this is all you need really
      • oh and : for Python
  • C++ multi-line

    • comes up in _gen/bin/text_files.cc
    • this is an architecture issues, will allow Rust/Lua as well
  • Max color mode for debugging?

    • detail: blank lines in re2c blocks shouldn't be significant
      • I guess you detect whitespace in re2c blocks then
  • SLOC

  • Maybe add language for *.test.sh

    • the #### and ## lines are special

src-tree:

  • should README.md be inserted in index.html ?
    • probably, sourcehut has this too
    • use cmark
      • also use our TOC plugin
  • line counts in metrics/source-code.sh could link to src-tree
    • combine the CI jobs
    • should use micro_syntax --wc to count SLOC
      • Also micro_syntax --print --format ansi
        • this is the default?

Later:

  • Recast as TSV-Netstring, which is different than TSV8

    • select one of these cols:
      • Path, HTML, num_lines, num_sig_lines, and I guess ANSI?
        • line/tokens binary? You have line number and so forth
    • 3:foo\t 5:foo\n # last cell has to end with newline ?
      • preserves wc -l when data has no newlines?
      • or is this too easily confused with TSV8 itself? You don't want it to be valid TSV8?
      • it can start out as text
      • !tsv8 Str
      • !type
      • !nets-row ?
    • another thing you can do is have a prefix of cells
      • netstring is 3:foo,
      • prefix is 5; 3:foo,
        • that's a command to read 5 net strings I guess?
  • Parsing, jump to definition