NaturalDocs:: Languages:: Advanced

NaturalDocs::Languages::Base

NaturalDocs::Languages::Advanced

NaturalDocs::Languages::Perl

The base class for all languages that have full support in Natural Docs. Each one will have a custom parser capable of documenting undocumented aspects of the code.

Summary

NaturalDocs:: Languages:: Advanced	The base class for all languages that have full support in Natural Docs.
Implementation
Members	The class is implemented as a blessed arrayref.
Functions
New	Returns a new language object and adds it to NaturalDocs::Languages.
Tokens	Returns the tokens found by ParseForCommentsAndTokens().
SetTokens	Replaces the tokens.
ClearTokens	Resets the token list.
AutoTopics	Returns the arrayref of automatically generated topics, or undef if none.
AddAutoTopic	Adds a NaturalDocs::Parser::ParsedTopic to AutoTopics().
ClearAutoTopics	Resets the automatic topic list.
ScopeRecord	Returns an arrayref of NaturalDocs::Languages::Advanced::ScopeChange objects describing how and when the scope changed thoughout the file.
Parsing Functions	These functions are good general language building blocks.
ParseForCommentsAndTokens	Loads the passed file, sends all appropriate comments to NaturalDocs::Parser->OnComment(), and breaks the rest into an arrayref of tokens.
TokenizeLine	Converts the passed line to tokens as described in ParseForCommentsAndTokens and adds them to Tokens().
TryToSkipString	If the position is on a string delimiter, moves the position to the token following the closing delimiter, or past the end of the tokens if there is none.
SkipRestOfLine	Moves the position to the token following the next line break, or past the end of the tokens array if there is none.
SkipUntilAfter	Moves the position to the token following the next occurance of a particular token sequence, or past the end of the tokens array if it never occurs.
IsFirstLineToken	Returns whether the position is at the first token of a line, not including whitespace.
IsLastLineToken	Returns whether the position is at the last token of a line, not including whitespace.
IsAtSequence	Returns whether the position is at a sequence of tokens.
IsBackslashed	Returns whether the position is after a backslash.
Scope Functions	These functions provide a nice scope stack implementation for language-specific parsers to use.
ClearScopeStack	Clears the scope stack for a new file.
StartScope	Records a new scope level.
EndScope	Records the end of the current scope level.
ScopeSymbol	Returns the symbol that ends the current scope level, or undef if we are at the top level.
CurrentScope	Returns the current calculated scope, or undef if global.
CurrentNamespace	Returns the current calculated namespace, or undef if none.
CurrentPackage	Returns the current calculated package or class, or undef if none.
CurrentProtection	Returns the current protection, or undef if none.
SetNamespace	Sets the namespace for the current scope level.
SetPackage	Sets the package or class for the current scope level.
SetProtection	Sets the protection for the current scope level.
Support Functions
AddToScopeRecord	Adds a change to the scope record, condensing unnecessary entries.
CreateString	Converts the specified tokens into a string and returns it.

Members

The class is implemented as a blessed arrayref. The following constants are used as indexes.

TOKENS	An arrayref of tokens used in all the Parsing Functions.
SCOPE_STACK	An arrayref of NaturalDocs::Languages::Advanced::Scope objects serving as a scope stack for parsing. There will always be one available, with a symbol of undef, for the top level.
SCOPE_RECORD	An arrayref of NaturalDocs::Languages::Advanced::ScopeChange objects, as generated by the scope stack. If there is more than one change per line, only the last is stored.
AUTO_TOPICS	An arrayref of NaturalDocs::Parser::ParsedTopics generated automatically from the code.

ParseForCommentsAndTokens

sub ParseForCommentsAndTokens #	(	sourceFile,
		lineCommentSymbols, openingCommentSymbols,
		closingCommentSymbols	)

Loads the passed file, sends all appropriate comments to NaturalDocs::Parser->OnComment(), and breaks the rest into an arrayref of tokens. Tokens are defined as

All consecutive alphanumeric and underscore characters.
All consecutive whitespace.
A single line break. It will always be “\n”; you don’t have to worry about platform differences.
A single character not included above, which is usually a symbol. Multiple consecutive ones each get their own token.

The result will be placed in Tokens().

Parameters

sourceFile	The source file to load and parse.
lineCommentSymbols	An arrayref of symbols that designate line comments, or undef if none.
openingCommentSymbols	An arrayref of symbols that designate the start of multiline comments, or undef if none.
closingCommentSymbols	An arrayref of symbols that designate the end of multiline comments, or undef if none.

Notes

This function automatically calls ClearAutoTopics() and ClearScopeStack(). You only need to call those functions manually if you override this one.
To save parsing time, all comment lines sent to NaturalDocs::Parser->OnComment() will be replaced with blank lines in Tokens(). It’s all the same to most languages.

TryToSkipString

sub TryToSkipString #	(	indexRef,
		lineNumberRef, openingDelimiter, closingDelimiter, startContentIndexRef,
		endContentIndexRef	)

If the position is on a string delimiter, moves the position to the token following the closing delimiter, or past the end of the tokens if there is none. Assumes all other characters are allowed in the string, the delimiter itself is allowed if it’s preceded by a backslash, and line breaks are allowed in the string.

Parameters

indexRef	A reference to the position’s index into Tokens().
lineNumberRef	A reference to the position’s line number.
openingDelimiter	The opening string delimiter, such as a quote or an apostrophe.
closingDelimiter	The closing string delimiter, if different. If not defined, assumes the same as openingDelimiter.
startContentIndexRef	A reference to a variable in which to store the index of the first token of the string’s content. May be undef.
endContentIndexRef	A reference to a variable in which to store the index of the end of the string’s content, which is one past the last index of content. May be undef.

Returns

Whether the position was on the passed delimiter or not. The index, line number, and content index ref variables will be updated only if true.

SkipUntilAfter

sub SkipUntilAfter #	(	indexRef,
		lineNumberRef, token, token,
		token ...	)

Moves the position to the token following the next occurance of a particular token sequence, or past the end of the tokens array if it never occurs. Useful for multiline comments.

Note that it skips blindly. It assumes there cannot be anything gof interest, such as a string delimiter, between the position and the end of the line.

Parameters

indexRef	A reference to the position’s index.
lineNumberRef	A reference to the position’s line number.
token	A token that must be matched. Can be specified multiple times to match a sequence of tokens.

Scope Functions

These functions provide a nice scope stack implementation for language-specific parsers to use. The default implementation makes the following assumptions.

Namespaces and packages completely replace one another, rather than concatenating. If you call SetPackage(), it completely replaces the previous package for the current scope. You need to concatenate manually if that’s the behavior.
Namespaces and packages inherit. So if a scope level doesn’t set its own, the namespace and package are the same as the parent scope’s.
Protection applies to the current level only and does not inherit. So if one is not set for the current scope level, CurrentProtection() will return undef rather than the parent scope’s value.

NaturalDocs:: Languages:: Advanced

Implementation

Members

Functions

New

Tokens

SetTokens

ClearTokens

AutoTopics

AddAutoTopic

ClearAutoTopics

ScopeRecord

Parsing Functions

ParseForCommentsAndTokens

Parameters

Notes

TokenizeLine

TryToSkipString

Parameters

Returns

SkipRestOfLine

Parameters

SkipUntilAfter

Parameters

IsFirstLineToken

Parameters

IsLastLineToken

Parameters

IsAtSequence

Parameters

IsBackslashed

Parameters

Scope Functions

ClearScopeStack

StartScope

Parameters

EndScope

Parameters

ScopeSymbol

CurrentScope

CurrentNamespace

CurrentPackage

CurrentProtection

SetNamespace

Parameters

SetPackage

Parameters

SetProtection

Parameters

Support Functions

AddToScopeRecord

Parameters

CreateString

Parameters

Returns