Abstract base class for document parsers.
|C#||Visual Basic||Visual C++|
public abstract class Parser
Public MustInherit Class Parser
public ref class Parser abstract
Gets the instance of the Configuration class that holds the settings to be used.
The character encoding used in the document Stream, if applicable.
Determines whether the specified(Inherited from is equal to the current . .)
Allows an(Inherited from to attempt to free resources and perform other cleanup operations before the is reclaimed by garbage collection. .)
Creates a footer with filename info from the Uri
Serves as a hash function for a particular type.(Inherited from .)
Returns the next 'word' in rawBody, is iterative, so subsequent calls move to consecutive words.
Gets the(Inherited from of the current instance. .)
Returns list of words as strings in an ArrayList, that are in the Uri
Returns whether the word last returned by GetNextWord is part of the title.
Determines whether current word (at wordStart) is in an ignored region.
Whether the parser would need a stream to be passed to it in order to perform a ReadText or ReadLinks operation.
Creates a shallow copy of the current(Inherited from . .)
|ParseWords(String, ArrayList, WordCollection, StringBuilder, ArrayList)|
Parses rawBody into descrete Word objects and places them in readDocumentWords.
Applies any required processing to a chunk of text that typically forms either a word or whitespace block.
Processes the list of all words found in the document and returns a list that should be index.
|Read(Stream, Uri, Encoding)|
Reads a document and returns an object holding it's text and any links.
|ReadLinks(Stream, Encoding)|| Obsolete.|
Reads links to other pages.
|ReadText(Stream, Uri, Encoding)|| Obsolete.|
Reads text and returns list of words and title
Resets the current word being processed.
Returns a(Inherited from that represents the current . .)
Removes repeated non-letters from word.
The current word's end.
The current word's start.
Assembly: Keyoti2.SearchEngine.Core (Module: Keyoti2.SearchEngine.Core) Version: 2010.4.1.609