lyx_mirror/lib/examples/noweb2lyx.lyx
Lars Gullik Bjønnes 27de1486ca Initial revision
git-svn-id: svn://svn.lyx.org/lyx/lyx-devel/trunk@140 a592a061-630c-0410-9148-cb99ea01b6c8
1999-09-27 18:44:28 +00:00

1359 lines
22 KiB
Plaintext

#This file was created by <kayvan> Sun May 2 15:56:35 1999
#LyX 1.0 (C) 1995-1999 Matthias Ettrich and the LyX Team
\lyxformat 2.15
\textclass literate-article
\begin_preamble
%
% This relaxes the noweb constraint that chunks are
% never broken across pages.
%
% This is from the noweb FAQ
%
\def\nwendcode{\endtrivlist \endgroup}
\let\nwdocspar=\smallbreak
\end_preamble
\language default
\inputencoding default
\fontscheme default
\graphics default
\paperfontsize default
\spacing single
\papersize Default
\paperpackage a4
\use_geometry 0
\use_amsmath 0
\paperorientation portrait
\secnumdepth 3
\tocdepth 3
\paragraph_separation indent
\defskip medskip
\quotes_language english
\quotes_times 2
\papercolumns 1
\papersides 1
\paperpagestyle default
\layout Title
\noun on
noweb2lyx
\layout Author
Kayvan A.
Sylvan <kayvan@sylvan.com>
\layout Date
May 6, 1999
\layout Abstract
This document describes and implements a perl script for importing noweb
files into LyX
\layout Standard
\pagebreak_bottom
\begin_inset LatexCommand \tableofcontents{}
\end_inset
\layout Section
Introduction
\layout Standard
Since version 1.0.1, LyX now supports Literate Programming using
\noun on
noweb
\noun default
.
This addition to LyX made it very pleasant to write programs in the literate
style (like this one).
In addition to being able to write new literate programs, it would be quite
useful if old
\noun on
noweb
\noun default
code could be imported into LyX in some fashion.
That's where this program comes in.
\layout Standard
The purpose of
\noun on
noweb2lyx
\noun default
is to convert a
\noun on
noweb
\noun default
file to LyX.
\layout Scrap
<<noweb2lyx.in>>=
\newline
#!@PERL@
\newline
#
\newline
# Copyright (C) 1999 Kayvan A.
Sylvan <kayvan@sylvan.com>
\newline
#
\protected_separator
You are free to use and modify this code under the terms of
\newline
# the GNU General Public Licence version 2 or later.
\newline
#
\newline
#
\protected_separator
Written with assistance from:
\newline
#
\protected_separator
Edmar Wienskoski Jr.
<edmar-w-jr@technologist.com>
\newline
#
\protected_separator
Amir Karger <karger@post.harvard.edu>
\newline
#
\newline
# $Id: noweb2lyx.lyx,v 1.1 1999/09/27 18:44:32 larsbj Exp $
\newline
#
\newline
# NOTE: This file was automatically generated from noweb2lyx.lyx using noweb.
\newline
#
\newline
<<Setup variables from user supplied args>>
\newline
<<Subroutines>>
\newline
<<Convert noweb to LyX>>
\newline
@
\layout Section
The Noweb file defined
\layout Standard
A
\noun on
noweb
\noun default
file is a collection of documentation and code chunks.
Documentation chunks simply start with an ``@'' and have no name:
\layout LyX-Code
@ Here is some documentation.
\newline
We can do arbitrary LaTeX code here.
\newline
[...
blah blah blah ...]
\layout Standard
Code chunks look like this:
\layout LyX-Code
<<Name of chunk here>>=
\newline
{...
code for the chunk goes here ...}
\newline
@
\layout Standard
The ``@'' is a necessary delimiter to end the code chunk.
The other form that the ``@'' line takes is as follows:
\layout LyX-Code
<<Name of chunk here>>=
\newline
{...
code for the chunk ...}
\newline
@ %def identifier1 identifier2
\layout Standard
In the latter form, we are declaring to
\noun on
noweb
\noun default
that this code chunk defines identifier1, identifier2, etc.
\layout Standard
When first tackling this problem, I spoke with members of the LyX team that
knew about the literate programming extensions and reLyX (the LaTeX importing
code).
\layout Standard
One of the first ideas was to extend the reLyX code to understand the
\noun on
noweb
\noun default
code chunks.
This proved to be too hard and presents other problems
\begin_float footnote
\layout Standard
Not the least of these problems is the fact that << is a quote in French.
\end_float
.
On the other hand, it turns out that reLyX contains a very useful literal
quoting mechanism.
If the input file contains the construct
\layout LyX-Code
\backslash
begin{reLyXskip}
\newline
{...
LaTeX stuff ...}
\newline
\backslash
end{reLyXskip}
\layout Standard
then reLyX will copy the surrounded code to the output file verbatim.
Given this, the first part of the translation is easy; we simply have to
copy the code chunks into an intermediate file that surrounds them with
\family typewriter
\backslash
begin{reLyXskip}
\family default
and
\family typewriter
\backslash
end{reLyXskip}
\family default
.
\layout Standard
Once reLyX is done with the input file, the problem is reduced to changing
the code chunks from LyX's LaTeX layout to the Scrap layout.
\layout Standard
There is one final constraint on
\noun on
noweb2lyx
\noun default
.
We want to be able to run it as a simple pre-processor and post-processor
from within reLyX.
We can accomplish this by setting the flags
\latex latex
[[pre_only]]
\latex default
and
\latex latex
[[post_only]]
\latex default
before we reach the main conversion code.
\layout Standard
With all that preamble out of the way, we now have the basic high-level
outline for our code:
\layout Scrap
<<Convert noweb to LyX>>=
\newline
if (!$post_only) {
\newline
\protected_separator
\protected_separator
<<Transform noweb for reLyX>>
\newline
}
\newline
if ((!$pre_only) && (!$post_only)) {
\newline
\protected_separator
\protected_separator
<<Run reLyX on intermediate file>>
\newline
}
\newline
if (!$pre_only) {
\newline
\protected_separator
\protected_separator
<<Fix up LyX file>>
\newline
}
\newline
<<Clean up>>
\newline
@
\layout Section
Making a file that reLyX can process
\layout Standard
In this section, we present the code that performs the task of creating
the intermediate file that reLyX can process, using the algorithm that
we just outlined.
This algorithm is outlined in the code that follows:
\layout Scrap
<<Transform noweb for reLyX>>=
\newline
<<Setup INPUT and OUTPUT>>
\newline
inputline: while(<INPUT>)
\newline
{
\newline
\protected_separator
\protected_separator
if (/^
\backslash
s*
\backslash
<
\backslash
<.*
\backslash
>
\backslash
>=/) { # Beginning of a noweb scrap
\newline
\protected_separator
\protected_separator
\protected_separator
\protected_separator
<<Read in and output the noweb code chunk>>
\newline
\protected_separator
\protected_separator
} elsif (/^@
\backslash
s+(.*)/) { # Beginning of a documentation chunk
\newline
\protected_separator
\protected_separator
\protected_separator
\protected_separator
print OUTPUT $1; # We do not need the ``@'' part
\newline
\protected_separator
\protected_separator
} elsif (/
\backslash
[
\backslash
[.+
\backslash
]
\backslash
]/) { # noweb quoted code
\newline
\protected_separator
\protected_separator
\protected_separator
\protected_separator
<<Perform special input quoting of [[var]]>>
\newline
\protected_separator
\protected_separator
} else {
\newline
\protected_separator
\protected_separator
\protected_separator
\protected_separator
print OUTPUT; # Just let the line pass through
\newline
\protected_separator
\protected_separator
}
\newline
}
\newline
<<Close INPUT and OUTPUT>>
\newline
@
\layout Standard
In the code above, we do some pre-processing of the noweb ``[[...]]'' construct.
This avoids some problems with reLyX confusing lists composed of ``[[...]]''
constructs.
\layout Scrap
<<Perform special input quoting of [[var]]>>=
\newline
s/
\backslash
[
\backslash
[.+?
\backslash
]{2,}/{$&}/g;
\newline
print OUTPUT;
\newline
@
\layout Standard
While reading in the
\latex latex
[[INPUT]]
\latex default
file, once we have identified a
\noun on
noweb
\noun default
code chunk, we transform it into a form that is usable by reLyX.
\layout Scrap
<<Read in and output the noweb code chunk>>=
\newline
<<Save the beginning of the scrap to savedScrap>>
\newline
<<Concatenate the rest of the scrap>>
\newline
<<print out the scrap in a reLyXskip block>>
\newline
@
\layout Subsection
File input and output for the pre-processing step
\layout Standard
In
\noun on
noweb2lyx
\noun default
, we will use
\latex latex
[[INPUT]]
\latex default
and
\latex latex
[[OUTPUT]]
\latex default
to read and write files.
In the code fragment above, we need to read from the input file and write
to a file that will be later transformed by reLyX.
If we are being called only to pre-process the input file, then there is
no need to create a temporary file.
\layout Scrap
<<Setup INPUT and OUTPUT>>=
\newline
if ($pre_only) {
\newline
\protected_separator
\protected_separator
&setup_files($input_file, $output_file);
\newline
} else {
\newline
\protected_separator
\protected_separator
$relyx_file = "temp$$";
\newline
\protected_separator
\protected_separator
&setup_files($input_file, $relyx_file);
\newline
}
\newline
@
\layout Standard
This code uses a small perl subroutine,
\latex latex
[[setup_files]]
\latex default
, which we define below:
\layout Scrap
<<Subroutines>>=
\newline
sub setup_files {
\newline
\protected_separator
\protected_separator
my($in, $out) = @_;
\newline
\protected_separator
\protected_separator
open(INPUT, "<$in") || die "Can not read $in: $!
\backslash
n";
\newline
\protected_separator
\protected_separator
open(OUTPUT, ">$out") || die "Can not write $out: $!
\backslash
n";
\newline
}
\newline
@ %def setup_files
\protected_separator
\protected_separator
\layout Subsection
Reading in the
\noun on
noweb
\noun default
scrap
\layout Standard
After we see the beginning of the scrap, we need to read in and save the
rest of the scrap for output.
\layout Scrap
<<Save the beginning of the scrap to savedScrap>>=
\newline
$savedScrap = $_;
\newline
$endLine = "";
\newline
@
\layout Scrap
<<Concatenate the rest of the scrap>>=
\newline
scrapline: while (<INPUT>) {
\newline
\protected_separator
\protected_separator
last scrapline if /^@
\backslash
s+/;
\newline
\protected_separator
\protected_separator
$savedScrap .= $_;
\newline
};
\newline
switch: {
\newline
\protected_separator
\protected_separator
if (/^@
\backslash
s+$/) {$savedScrap .= $_; last switch; }
\newline
\protected_separator
\protected_separator
if (/^@
\backslash
s+%def.*$/) {$savedScrap .= $_; last switch; }
\newline
\protected_separator
\protected_separator
if (/^@
\backslash
s+(.*)$/) {$savedScrap .= "@
\backslash
n"; $endLine = "$1
\backslash
n"; }
\newline
}
\newline
@
\layout Subsection
Printing out the scrap
\layout Standard
The final piece of the first pass of the conversion is done by this code.
\layout Scrap
<<print out the scrap in a reLyXskip block>>=
\newline
print OUTPUT "
\backslash
\backslash
begin{reLyXskip}
\backslash
n";
\newline
print OUTPUT $savedScrap;
\newline
print OUTPUT "
\backslash
\backslash
end{reLyXskip}
\backslash
n
\backslash
n";
\newline
print OUTPUT "$endLine";
\newline
@
\layout Standard
Finally, we need to close the
\latex latex
[[INPUT]]
\latex default
and
\latex latex
[[OUTPUT]]
\latex default
files.
\layout Scrap
<<Close INPUT and OUTPUT>>=
\newline
close(INPUT);
\newline
close(OUTPUT);
\newline
@
\layout Section
Running reLyX
\layout Standard
In this section, we describe and implement the code that runs reLyX on the
intermediate file
\latex latex
[[relyx_file]]
\latex default
.
\layout Subsection
Selecting the document class
\layout Standard
In order to run reLyX, we need to know the article class of the input document
(to choose the corresponding literate document layout).
For this, we need to parse the intermediate file.
\layout Scrap
<<Run reLyX on intermediate file>>=
\newline
<<Parse for document class>>
\newline
<<Run reLyX with document class>>
\newline
@
\layout Standard
In the code below, you'll see a strange regular expression to search for
the document class.
The reason for this kludge is that without it, we can't run
\noun on
noweb2lyx
\noun default
on the
\emph on
noweb2lyx.nw
\emph default
file that is generated by LyX
\begin_float footnote
\layout Standard
reLyX searches for
\backslash
\backslash
doc
\latex latex
{}
\latex default
ument
\latex latex
{}
\latex default
class and gets confused, so we have to obfuscate it slightly.
\end_float
.
With the regular expression as it is, we can actually run
\noun on
noweb2lyx
\noun default
on itself and a produce a quite reasonable LyX file.
\layout Scrap
<<Parse for document class>>=
\newline
open(INPUT, "<$relyx_file") ||
\newline
\protected_separator
\protected_separator
die "Can not read $relyx_file: $!
\backslash
n";
\newline
$class = "article"; # default if none found
\newline
parse: while(<INPUT>) {
\newline
\protected_separator
\protected_separator
if (/
\backslash
\backslash
docu[m]entclass{(.*)}/) {
\newline
\protected_separator
\protected_separator
\protected_separator
\protected_separator
$class = $1;
\newline
\protected_separator
\protected_separator
\protected_separator
\protected_separator
last parse;
\newline
\protected_separator
\protected_separator
}
\newline
}
\newline
close(INPUT);
\newline
@
\layout Subsection
Running reLyX with the corresponding literate document layout
\layout Standard
Now that we know what the document class ought to be, we do:
\layout Scrap
<<Run reLyX with document class>>=
\newline
$doc_class = "literate-" .
$class;
\newline
die "reLyX returned non-zero: $!
\backslash
n"
\newline
\protected_separator
\protected_separator
if (system("reLyX -c $doc_class $relyx_file"));
\newline
@
\layout Standard
reLyX performs the main bulk of the translation work.
Note that if the ``literate-
\emph on
class
\emph default
'' document layout is not found, then reLyX will fail with an error.
In that case, you may need to modify your
\noun on
noweb
\noun default
input file to a supported document type.
\layout Section
Fixing the reLyX output
\layout Standard
We need to perform some post-processing of what reLyX produces in order
to have the best output for our literate document.
The outline of the post-processing steps are:
\layout Scrap
<<Fix up LyX file>>=
\newline
<<Setup INPUT and OUTPUT for the final output>>
\newline
line: while(<INPUT>)
\newline
{
\newline
\protected_separator
\protected_separator
<<Fix code chunks in latex layout>>
\newline
\protected_separator
\protected_separator
<<Fix [[var]] noweb construct>>
\newline
\protected_separator
\protected_separator
print OUTPUT; # default
\newline
}
\newline
<<Close INPUT and OUTPUT>>
\newline
@
\layout Standard
Note that in the perl code that is contained in the
\latex latex
[[while(<INPUT>)]]
\latex default
loop above, the perl construct
\latex latex
[[next line]]
\latex default
is sufficient to restart the loop.
We can use this construct to do some relatively complex parsing of the
reLyX generated file.
\layout Subsection
File input and output for the post-processing
\layout Standard
Setting up the
\latex latex
[[INPUT]]
\latex default
and
\latex latex
[[OUTPUT]]
\latex default
is taken care of by this code:
\layout Scrap
<<Setup INPUT and OUTPUT for the final output>>=
\newline
if ($post_only) {
\newline
\protected_separator
\protected_separator
&setup_files("$input_file", "$output_file");
\newline
} else {
\newline
\protected_separator
&setup_files("$relyx_file.lyx", "$output_file");
\newline
}
\newline
@
\layout Subsection
Making sure the code chunks are in the Scrap layout
\layout Standard
Now, as we outlined before, the final step is transforming the code-chunks
which have been put into a LaTeX layout by LyX into the scrap layout.
\layout Scrap
<<Fix code chunks in latex layout>>=
\newline
if (/
\backslash
\backslash
latex latex/) { # Beginning of some latex code
\newline
\protected_separator
\protected_separator
if (($line = <INPUT>) =~ /^
\backslash
s*<</) { # code scrap
\newline
\protected_separator
\protected_separator
\protected_separator
<<Transform this chunk into layout scrap>>
\newline
\protected_separator
\protected_separator
} else {
\newline
\protected_separator
\protected_separator
\protected_separator
\protected_separator
# print the
\backslash
latex latex line + next line
\newline
\protected_separator
\protected_separator
\protected_separator
\protected_separator
print OUTPUT "$_$line";
\newline
\protected_separator
\protected_separator
}
\newline
\protected_separator
\protected_separator
next line;
\newline
}
\newline
@
\layout Standard
When we are sure that we are in a code chunk, we must read in the rest of
the code chunk and output a scrap layout for it:
\layout Scrap
<<Transform this chunk into layout scrap>>=
\newline
$savedScrap = "
\backslash
\backslash
layout Scrap
\backslash
n
\backslash
n$line";
\newline
codeline: while (<INPUT>) {
\newline
\protected_separator
\protected_separator
$savedScrap .= $_;
\newline
\protected_separator
\protected_separator
last codeline if /^@
\backslash
s+/;
\newline
};
\newline
print OUTPUT $savedScrap;
\newline
<<Slurp up to the end of the latex layout>>
\newline
@
\layout Standard
Okay, now we just need to eat the rest of the latex layout.
There should only be a few different types of lines for us to match:
\layout Scrap
<<Slurp up to the end of the latex layout>>=
\newline
slurp: while (<INPUT>) {
\newline
\protected_separator
\protected_separator
last slurp if /
\backslash
\backslash
latex /;
\newline
\protected_separator
\protected_separator
next slurp if /
\backslash
\backslash
newline/;
\newline
\protected_separator
\protected_separator
next slurp if /^
\backslash
s*$/;
\newline
\protected_separator
\protected_separator
warn "confused by line: $_";
\newline
}
\newline
@
\layout Subsection
Taking care of the
\noun on
noweb
\noun default
\emph on
[[quoted code]]
\emph default
construct
\layout Standard
\noun on
noweb
\noun default
allows the user to use a special code quoting mechanism in documentation
chunks.
Fixing this ``[[quoted-code]]''
\noun on
noweb
\noun default
syntax means putting the ``[[quoted-code]]'' in a LaTeX layout in the LyX
file.
Otherwise, LyX will backslash-quote the brackets, creating ugly output.
The quoted-code is transformed by
\noun on
noweb
\noun default
when it generates the final LaTeX code.
\layout Scrap
<<Fix [[var]] noweb construct>>=
\newline
if (/
\backslash
[
\backslash
[.+
\backslash
]
\backslash
]/) { # special code for [[var]]
\newline
\protected_separator
\protected_separator
s/
\backslash
[
\backslash
[.+?
\backslash
]{2,}/
\backslash
n
\backslash
\backslash
latex latex
\backslash
n$&
\backslash
n
\backslash
\backslash
latex default
\backslash
n/g;
\newline
\protected_separator
\protected_separator
print OUTPUT;
\newline
\protected_separator
\protected_separator
next line;
\newline
}
\newline
@
\layout Section
Cleaning up intermediate files
\layout Standard
The cleanup code is very simple:
\layout Scrap
<<Clean up>>=
\newline
system("rm -f $relyx_file*") unless ($post_only || $pre_only);
\newline
@
\layout Section
User supplied arguments
\layout Standard
The
\noun on
noweb2lyx
\noun default
script understands two arguments, input-file and output-file.
It is also set up to be used internally by reLyX to pre-process or postprocess
files in the import pipeline.
\layout Scrap
<<Setup variables from user supplied args>>=
\newline
&usage() if ($#ARGV < 1); # zero or one argument
\newline
if ($ARGV[0] eq "-pre") {
\newline
\protected_separator
\protected_separator
&usage unless ($#ARGV == 2);
\newline
\protected_separator
\protected_separator
$input_file = $ARGV[1]; $output_file = $ARGV[2]; $pre_only = 1;
\newline
} elsif ($ARGV[0] eq "-post") {
\newline
\protected_separator
\protected_separator
&usage unless ($#ARGV == 2);
\newline
\protected_separator
\protected_separator
$input_file = $ARGV[1]; $output_file = $ARGV[2]; $post_only = 1;
\newline
} else {
\newline
\protected_separator
\protected_separator
&usage unless ($#ARGV == 1);
\newline
\protected_separator
\protected_separator
$input_file = $ARGV[0];
\protected_separator
$output_file = $ARGV[1];
\newline
\protected_separator
\protected_separator
$pre_only = 0; $post_only = 0;
\newline
}
\newline
@ %def input_file output_file pre_only post_only
\layout Scrap
<<Subroutines>>=
\newline
sub usage() {
\newline
\protected_separator
\protected_separator
print "Usage: noweb2lyx [-pre | -post] input-file output-file
\newline
\newline
If -pre is specified, only pre-processes the input-file for reLyX.
\newline
Similarly, in the case of -post, post-processes reLyX output.
\newline
In case of bugs, Email Kayvan Sylvan <kayvan
\backslash
@sylvan.com>.
\backslash
n";
\newline
\protected_separator
\protected_separator
exit;
\newline
}
\newline
@ %def usage
\layout Section
Generating the
\noun on
noweb2lyx
\noun default
script
\layout Standard
The noweb2lyx script can be tangled from LyX if you set
\family typewriter
\backslash
build_command
\family default
to call a generic script that always extracts a scrap named
\family typewriter
build-script
\family default
and executes it.
Here is an example of such a script:
\layout LyX-Code
#!/bin/sh
\newline
notangle -Rbuild-script $1 | sh
\layout Scrap
<<build-script>>=
\newline
PREFIX=/usr
\newline
notangle -Rnoweb2lyx.in noweb2lyx.nw > noweb2lyx.in
\newline
sed -e "s=@PERL@=$PREFIX/bin/perl=" noweb2lyx.in > noweb2lyx
\newline
chmod +x noweb2lyx
\newline
@
\layout Section*
\pagebreak_top
Macros
\layout Standard
\latex latex
\backslash
nowebchunks
\layout Section*
Identifiers
\layout Standard
\latex latex
\backslash
nowebindex
\the_end