Plasma GitLab Archive
Projects Blog Knowledge

Module Netulex.Ulexing


module Ulexing: sig .. end


This is a lexing buffer for ulex.
type lexbuf 
exception Error
Lexical error
val from_ulb_lexbuf : Netulex.ULB.unicode_lexbuf -> lexbuf
Creates a new lexbuf from the unicode_lexbuf. After that, the unicode_lexbuf must no longer be modified.
val lexeme_start : lexbuf -> int
The character position of the start of the lexeme
val lexeme_end : lexbuf -> int
The character position of the end of the lexeme
val lexeme_length : lexbuf -> int
The length of the lexeme in characters
val lexeme : lexbuf -> int array
Returns the lexeme as array of Unicode code points
val lexeme_char : lexbuf -> int -> int
Returns the code point of a certain character of the lexeme
val sub_lexeme : lexbuf -> int -> int -> int array
Returns a substring of the lexeme as array of Unicode code points. The first int is the characater position where to start, the second int is the number of characters.
val utf8_lexeme : lexbuf -> string
Returns the lexeme as UTF-8 encoded string
val utf8_sub_lexeme : lexbuf -> int -> int -> string
Returns a substring of the lexeme as UTF-8 encoded string. The first int is the characater position where to start, the second int is the number of characters.
val utf8_sub_lexeme_length : lexbuf -> int -> int -> int
Same as String.length(utf8_sub_lexeme args), i.e. returns the number of bytes a certain sub lexeme will have when encoded as UTF-8 string.
This web site is published by Informatikbüro Gerd Stolpmann
Powered by Caml