NlpSaftToken
NLPInfrastructureGoogleApi.ContentWarehouse.V1.Model.NlpSaftToken
SEO Analysis
AI GeneratedBackend infrastructure with indirect SEO impact. This model (Nlp Saft Token) contains 15 attributes that define its data structure. Key functionality includes: Whether the break skipped over non-tag text (excluding script/style).
Actionable Insights for SEOs
- Understanding this model helps SEOs grasp Google's internal data architecture
- Consider how this system might interact with other ranking signals
Attributes
15breakLevelstringnilFull type: String.tbreakSkippedTextboolean(nilWhether the break skipped over non-tag text (excluding script/style).
categorystringnilFull type: String.tCoarse-grained word category for token. See README.categories for category inventory.
endinteger(nilheadinteger(nilHead of this token in the dependency tree: the id of the token which has an arc going to this one. If it is the root token of a sentence, then it is set to -1.
nilFull type: GoogleApi.ContentWarehouse.V1.Model.Proto2BridgeMessageSet.tAnnotation for this token.
labelstringnilFull type: String.tLabel for dependency relation between this token and its head. See README.labels for label inventory.
lemmastringnilFull type: String.tWord lemma. This is only filled if the lemma is different from the word form.
nilFull type: GoogleApi.ContentWarehouse.V1.Model.NlpSaftMorphology.tMorphology information.
scriptCodestringnilFull type: String.tA string representation (typically four letters, sometimes longer) of the token's Unicode script code, based on BCP 47/CLDR, capitalized according to ISO 15924. See i18n/identifiers/scriptcode.h for details.
startinteger(nil[start, end] describe the inclusive byte range of the UTF-8 encoded token in document.text. End gives the index of the last byte, which may be a UTF-8 continuation byte, and the length in bytes is end - start + 1. begin/end options are for goldmine AnnotationsFinder to locate the offsets of saft tokens. Start is inclusive by default and end is marked.
tagstringnilFull type: String.tPart-of-speech tag for token. See README.tags for tag inventory.
tagConfidencenumber(nilConfidence score for the tag prediction -- should be interpreted as a probability estimate that the tag is correct.
textPropertiesinteger(nilwordstringnilFull type: String.tToken word form. This may not be identical to the original. For example, in goldmine annotation we do UTF-8 normalization and punctuation normalization. The punctuation normalization includes inferring the directionality of straight doublequotes -- that is, we map " to open quote (``) or close quote (''), and sometimes we get it wrong. SAFT processing in other contexts (such as queries in qrewrite) involves different normalizations.