qse/ase/doc/awk-en.man

226 lines
7.3 KiB
Groff
Raw Normal View History

2007-02-14 09:25:02 +00:00
.title ASEAWK
== ASEAWK ==
2007-02-20 05:40:11 +00:00
ASE provides an embeddable processor of a dialect of the AWK programming language. The language implemented is slightly different from {the version developed by Brian W. Kernighan, http://cm.bell-labs.com/cm/cs/awkbook/index.html} and has been adjusted to the author's preference.
2007-02-14 09:25:02 +00:00
2007-03-12 15:24:19 +00:00
=== Overview ===
2007-03-02 10:06:17 +00:00
The following code fragment illustrates the basic steps of embedding the processor.
2007-02-17 15:00:25 +00:00
{{{
2007-02-20 05:40:11 +00:00
1) #include <ase/awk/awk.h>
2) ase_awk_t* awk;
3) awk = ase_awk_open (...);
4) if (ase_awk_parse (awk, ...) == -1)
{
/* parse error */
}
else
{
5) if (ase_awk_run (awk, ...) == -1)
{
/* run-time error */
}
}
6) ase_awk_close (awk);
2007-02-17 15:00:25 +00:00
}}}
2007-02-20 05:40:11 +00:00
(((
2007-03-12 15:24:19 +00:00
* Most of the functions and data types needed are defined in the header file ##ase/awk/awk.h##.
* ##ase_awk_t## represents the processor. However, the internal representation is not exposed.
* ##ase_awk_open## creates the processor instance.
* ##ase_awk_parse## parses an AWK script.
* ##ase_awk_run## executes the script parsed.
* ##ase_awk_close## destroys the processor instance.
2007-02-20 05:40:11 +00:00
)))
2007-03-12 15:24:19 +00:00
An embedding example is available in ##ase/test/awk/awk.c##. Refer to the sample code along with this document for more information.
=== Primitive Functions ===
2007-03-13 07:57:53 +00:00
A set of primitive functions is needed to create an instance of the AWK processor. A primitive function is a user-defined function to help the library perform system-dependent operations such as memory allocation, character class handling.
2007-03-12 14:43:34 +00:00
{{{
2007-03-13 07:57:53 +00:00
typedef struct ase_awk_prmfns_t ase_awk_prmfns_t;
2007-03-12 14:43:34 +00:00
struct ase_awk_prmfns_t
{
ase_mmgr_t mmgr;
ase_ccls_t ccls;
struct
{
ase_awk_pow_t pow;
ase_awk_sprintf_t sprintf;
ase_awk_dprintf_t dprintf;
void* custom_data;
} misc;
};
}}}
2007-03-12 15:24:19 +00:00
A caller of ##ase_awk_open## should fill in most of the fields of a ##ase_awk_prmfns_t## structure and pass the structure to it. The function pointers in the miscellaneous group labeled [misc] is defined as follows:
2007-03-12 14:43:34 +00:00
{{{
/* returns the value of x raised to the power of y */
typedef ase_real_t (*ase_awk_pow_t) (void* custom, ase_real_t x, ase_real_t y);
/* similar to snprintf of the standard C library. */
typedef int (*ase_awk_sprintf_t) (
void* custom, ase_char_t* buf, ase_size_t size,
const ase_char_t* fmt, ...);
/* similar to printf of the standard C library. called by a few uncommonly
* used output functions usually for debugging purpose */
typedef void (*ase_awk_dprintf_t) (
void* custom, const ase_char_t* fmt, ...);
}}}
2007-03-12 15:24:19 +00:00
The fourth field of the group is passed to its member functions as the first argument on invocation. The function pointed by the ##sprintf## field should ensure that the resuliting string is null-terminated and ##%s## and ##%c## are accepted for the ##ase_char_t*## and ##ase_char_t## type respectively regardless the character mode.
2007-03-12 14:43:34 +00:00
2007-03-12 15:24:19 +00:00
The memory manager group labeled [mmgr] and the character class group labled [ccls] are defined as follows:
2007-02-17 15:00:25 +00:00
2007-03-02 10:06:17 +00:00
{{{
typedef void* (*ase_malloc_t) (void* custom, ase_size_t n);
typedef void* (*ase_realloc_t) (void* custom, void* ptr, ase_size_t n);
typedef void (*ase_free_t) (void* custom, void* ptr);
typedef ase_bool_t (*ase_isccls_t) (void* custom, ase_cint_t c);
typedef ase_cint_t (*ase_toccls_t) (void* custom, ase_cint_t c);
struct ase_mmgr_t
{
ase_malloc_t malloc;
ase_realloc_t realloc;
ase_free_t free;
void* custom_data;
};
struct ase_ccls_t
{
ase_isccls_t is_upper;
ase_isccls_t is_lower;
ase_isccls_t is_alpha;
ase_isccls_t is_digit;
ase_isccls_t is_xdigit;
ase_isccls_t is_alnum;
ase_isccls_t is_space;
ase_isccls_t is_print;
ase_isccls_t is_graph;
ase_isccls_t is_cntrl;
ase_isccls_t is_punct;
ase_toccls_t to_upper;
ase_toccls_t to_lower;
void* custom_data;
};
2007-03-12 14:43:34 +00:00
}}}
2007-03-02 10:06:17 +00:00
2007-03-12 15:24:19 +00:00
The functions in these groups perform the memory operations and character class related operations respectively. They follow the style of the memory allocation functions and character class handling functions of the standard C library except that they accept a pointer to the user-defined data as the first argument, thus providing more flexibility. The pointer to the user-defined data is specified into the ##custom_data## field of each group. The ##realloc## field, however, can be ##ASE_NULL##, in which case the functions pointed by the free and the malloc field replace the role of the function pointed by the ##realloc## field.
2007-03-02 10:06:17 +00:00
2007-02-17 15:00:25 +00:00
=== Source IO Handler ===
2007-03-13 07:57:53 +00:00
The source code is handled by a source input handler provided by the user. The optional source code output handler can be provided to have the internal parse tree converted back to the source code.
2007-02-17 15:00:25 +00:00
2007-03-13 07:57:53 +00:00
The source code handler is defined as follows:
{{{
typedef ase_ssize_t (*ase_awk_io_t) (
int cmd, void* custom, ase_char_t* data, ase_size_t count);
typedef struct ase_awk_srcios_t ase_awk_srcios_t;
struct ase_awk_srcios_t
{
ase_awk_io_t in; /* source input */
ase_awk_io_t out; /* source output */
void* custom_data;
};
}}}
The ##in## field of the ase_awk_srcios_t is mandatory and shoul be filled in. The ##out##field can be set to ASE_NULL or can point to a source output handling function. The ##custom_data## field is passed to the source handlers as the second argument.
The first parameter ##cmd## of the source input handler is one of the following values:
{{{
enum
{
ASE_AWK_IO_OPEN = 0,
ASE_AWK_IO_CLOSE = 1,
ASE_AWK_IO_READ = 2
};
}}}
The first parameter ##cmd## of the source output handler is one of the following values:
{{{
enum
{
ASE_AWK_IO_OPEN = 0,
ASE_AWK_IO_CLOSE = 1,
ASE_AWK_IO_WRITE = 3
};
}}}
The third parameter ##data## and the fourth field ##count## are used if the ##cmd## field is either ASE_AWK_IO_READ or ASE_AWK_IO_WRITE.
{{{
static ase_ssize_t awk_srcio_in (
int cmd, void* arg, ase_char_t* data, ase_size_t size)
{
struct awk_src_io* src_io = (struct awk_src_io*)arg;
ase_cint_t c;
if (cmd == ASE_AWK_IO_OPEN)
{
if (src_io->input_file == ASE_NULL) return 0;
src_io->input_handle = ase_fopen (src_io->input_file, ASE_T("r"));
if (src_io->input_handle == NULL) return -1;
return 1;
}
else if (cmd == ASE_AWK_IO_CLOSE)
{
if (src_io->input_file == ASE_NULL) return 0;
fclose ((FILE*)src_io->input_handle);
return 0;
}
else if (cmd == ASE_AWK_IO_READ)
{
if (size <= 0) return -1;
c = ase_fgetc ((FILE*)src_io->input_handle);
if (c == ASE_CHAR_EOF) return 0;
*data = (ase_char_t)c;
return 1;
}
return -1;
}
}}}
{{{
static ase_ssize_t awk_srcio_out (
int cmd, void* arg, ase_char_t* data, ase_size_t size)
{
/*struct awk_src_io* src_io = (struct awk_src_io*)arg;*/
if (cmd == ASE_AWK_IO_OPEN) return 1;
else if (cmd == ASE_AWK_IO_CLOSE)
{
fflush (stdout);
return 0;
}
else if (cmd == ASE_AWK_IO_WRITE)
{
int n = ase_fprintf (stdout, ASE_T("%.*s"), size, data);
if (n < 0) return -1;
return size;
}
return -1;
}
}}}
2007-02-17 15:00:25 +00:00
=== External IO Handler ===
External IO handlers should be provided to support the AWK's built-in IO facilities.
2007-02-14 09:25:02 +00:00