Parser (Java Platform SE 8 )

java.lang.Object
- javax.swing.text.html.parser.Parser

All Implemented Interfaces:

DTDConstants

已知直接子类：

DocumentParser
```
public class Parser
extends Object
implements DTDConstants
```
一个简单的DTD驱动的HTML解析器。解析器读取InputStream HTML文件和调用各种方法（这应该是在子类中重写）时遇到的标签和数据。
不幸的是，有许多好实现HTML解析器那里，因此有许多严重格式化的HTML文件。这个解析器试图解析多数HTML文件。这意味着实施有时偏离支持HTML的SGML规范。

解析器把R与\r\n \ n换行后starttags之前结束标记被忽略，正如在SGML和HTML规范规定。

HTML规范没有指定的空间都是很好的融合。具体来说，下面的场景没有讨论（请注意这里使用的空间，但我使用的是要显示的空间）：

“等等 <罢工> foo”可被视为：“ 等等 <罢工> foo

以及：“
< a href =“XX”> 用

”，似乎被视为：“
< a href =“XX”> 用 < / > ”

如果strict是虚假的，当一个标签，打破流，（TagElement.breaksFlows）或尾随空格时，所有的空格将被忽略，直到遇到一个非空白字符。这似乎给了更接近流行的浏览器的行为。

另请参见：

DTD， TagElement， SimpleAttributeSet

Field Summary

Fields
Modifier and Type	Field and Description
`protected DTD`	`dtd`
`protected boolean`	`strict` 这个标志决定是否执行SGML解析器将兼容性要求严格。

Fields inherited from interface javax.swing.text.html.parser.DTDConstants
ANY, CDATA, CONREF, CURRENT, DEFAULT, EMPTY, ENDTAG, ENTITIES, ENTITY, FIXED, GENERAL, ID, IDREF, IDREFS, IMPLIED, MD, MODEL, MS, NAME, NAMES, NMTOKEN, NMTOKENS, NOTATION, NUMBER, NUMBERS, NUTOKEN, NUTOKENS, PARAMETER, PI, PUBLIC, RCDATA, REQUIRED, SDATA, STARTTAG, SYSTEM

构造方法摘要

构造方法

Constructor and Description

Parser(DTD dtd)

构造方法
Constructor and Description
`Parser(DTD dtd)`

方法摘要

所有方法接口方法具体的方法
Modifier and Type	Method and Description
`protected void`	`endTag(boolean omitted)` 处理结束标签。
`protected void`	`error(String err)`
`protected void`	`error(String err, String arg1)`
`protected void`	`error(String err, String arg1, String arg2)`
`protected void`	`error(String err, String arg1, String arg2, String arg3)` 调用错误处理程序。
`protected void`	`flushAttributes()`
`protected SimpleAttributeSet`	`getAttributes()`
`protected int`	`getCurrentLine()`
`protected int`	`getCurrentPos()`
`protected void`	`handleComment(char[] text)` 当一个HTML注释时。
`protected void`	`handleEmptyTag(TagElement tag)` 遇到一个空标记时调用。
`protected void`	`handleEndTag(TagElement tag)` 当遇到一个结束标记时调用。
`protected void`	`handleEOFInComment()`
`protected void`	`handleError(int ln, String msg)` 发生了一个错误。
`protected void`	`handleStartTag(TagElement tag)` 当遇到一个开始标记时调用。
`protected void`	`handleText(char[] text)` 打电话时遇到85。
`protected void`	`handleTitle(char[] text)` 当一个HTML标题标签是遇到。
`protected TagElement`	`makeTag(Element elem)`
`protected TagElement`	`makeTag(Element elem, boolean fictional)` 让tagelement。
`protected void`	`markFirstTime(Element elem)` 标记在文档中被看到的第一个标签
`void`	`parse(Reader in)` 解析HTML流，给定一个DTD。
`String`	`parseDTDMarkup()` 分析了文档声明式标记声明。
`protected boolean`	`parseMarkupDeclarations(StringBuffer strBuff)` 解析标记声明。
`protected void`	`startTag(TagElement tag)` 处理开始标签。

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Field Detail
- dtd
```
protected DTD dtd
```
- strict
```
protected boolean strict
```
  这个标志决定是否执行SGML解析器将兼容性要求严格。如果是错的，它将随着错误的HTML中的某些常见的类宽松的构建。严格或不严格，在任何一种情况下，将被记录的错误。

Constructor Detail
- Parser
```
public Parser(DTD dtd)
```

方法详细信息

getCurrentLine
```
protected int getCurrentLine()
```
结果

当前正在解析的行的行数

makeTag

protected TagElement makeTag(Element elem,
                             boolean fictional)

让tagelement。

makeTag

protected TagElement makeTag(Element elem)

getAttributes

protected SimpleAttributeSet getAttributes()

flushAttributes
```
protected void flushAttributes()
```

handleText

protected void handleText(char[] text)

打电话时遇到85。

handleTitle
```
protected void handleTitle(char[] text)
```
当一个HTML标题标签是遇到。

handleComment

protected void handleComment(char[] text)

当一个HTML注释时。

handleEOFInComment
```
protected void handleEOFInComment()
```

handleEmptyTag

protected void handleEmptyTag(TagElement tag)
                       throws ChangedCharSetException

遇到一个空标记时调用。

异常: ChangedCharSetException

handleStartTag
```
protected void handleStartTag(TagElement tag)
```
当遇到一个开始标记时调用。

handleEndTag
```
protected void handleEndTag(TagElement tag)
```
当遇到一个结束标记时调用。

handleError

protected void handleError(int ln,
                           String msg)

发生了一个错误。

error

protected void error(String err,
                     String arg1,
                     String arg2,
                     String arg3)

调用错误处理程序。

error

protected void error(String err,
                     String arg1,
                     String arg2)

error

protected void error(String err,
                     String arg1)

error
```
protected void error(String err)
```

startTag
```
protected void startTag(TagElement tag)
                 throws ChangedCharSetException
```
处理开始标签。新的标签被推到标签堆栈上。属性列表检查所需的属性。

异常

ChangedCharSetException

endTag
```
protected void endTag(boolean omitted)
```
处理结束标签。从标签堆栈中弹出结束标记。

markFirstTime
```
protected void markFirstTime(Element elem)
```
标记在文档中被看到的第一个标签

parseDTDMarkup
```
public String parseDTDMarkup()
                      throws IOException
```
分析了文档声明式标记声明。目前忽略它。

异常

IOException

parseMarkupDeclarations
```
protected boolean parseMarkupDeclarations(StringBuffer strBuff)
                                   throws IOException
```
解析标记声明。目前只处理文档类型声明标记。返回真，如果它是一个标记声明，否则为假。

异常

IOException

parse

public void parse(Reader in)
           throws IOException

解析HTML流，给定一个DTD。

异常: IOException

getCurrentPos
```
protected int getCurrentPos()
```

Submit a bug or feature
For further API reference and developer documentation, see Java SE Documentation. That documentation contains more detailed, developer-targeted descriptions, with conceptual overviews, definitions of terms, workarounds, and working code examples.
Copyright © 1993, 2014, Oracle and/or its affiliates. All rights reserved.

Class Parser

Field Summary

Fields inherited from interface javax.swing.text.html.parser.DTDConstants

构造方法摘要

方法摘要

Methods inherited from class java.lang.Object

Field Detail

dtd

strict

Constructor Detail

Parser

方法详细信息

getCurrentLine

makeTag

makeTag

getAttributes

flushAttributes

handleText

handleTitle

handleComment

handleEOFInComment

handleEmptyTag

handleStartTag

handleEndTag

handleError

error

error

error

error

startTag

endTag

markFirstTime

parseDTDMarkup

parseMarkupDeclarations

parse

getCurrentPos