RepositoryWebrefDocumentMetadata

Repository WebrefEntity Understanding

GoogleApi.ContentWarehouse.V1.Model.RepositoryWebrefDocumentMetadata

8
out of 10
High
SEO Impact
Information about the document which is not produced by webref, typically copied from the docjoin. Next available tag: 15

SEO Analysis

AI Generated

Part of Google's entity understanding system (WebRef). This system identifies and links entities—people, places, things, concepts—within web content to Google's Knowledge Graph. Entity recognition is fundamental to semantic search, helping Google understand content meaning beyond keywords and match it to relevant queries.

Actionable Insights for SEOs

  • Monitor for changes in rankings that may correlate with updates to this system
  • Consider how your content strategy aligns with what this signal evaluates
  • Build topical authority through comprehensive, entity-focused content
  • Implement structured data to help Google understand entities on your pages
  • Create content that clearly establishes entity relationships

Attributes

11
Sort:|Filter:
crawlTimestring
Default: nilFull type: String.t

The timestamp of when the document was crawled (if known). Copied from CompositeDoc.Content.CrawlTime.

docFpstring
Default: nilFull type: String.t

Fingerprint of the document. We compute and set this fingerprint when creating the pagesets that we use for evals. Otherwise, this field is not normally set. We use the field to make sure that the human ratings that we have are generated for the same version of the document, otherwise they might be invalid. We do not compute the fingerprint on the fly (e.g. as a fingerprint of the proto buffer serialization of the cdoc) because protocol buffer serialization is not stable.

docIdstring
Default: nilFull type: String.t

DocId of the annotated document as read from cdoc.doc().docid().

Default: nilFull type: GoogleApi.ContentWarehouse.V1.Model.RepositoryWebrefForwardingUrls.t

Urls that forward to this url. Needed for url -> topical entity entries.

isDisambiguationPageboolean(
Default: nil

Set to true if the document is a known disambiguation page, e.g. https://en.wikipedia.org/wiki/Orange.

languagestring
Default: nilFull type: String.t

The document language, as read from doc().content().language(). This is go/language-enum value.

numIncomingAnchorsnumber(
Default: nil

The (weighted) number of incoming anchors (links from other documents).

Default: nilFull type: GoogleApi.ContentWarehouse.V1.Model.QualitySalientTermsSalientTermSet.t

The salient terms for this document. Only set if --webref_doc_metadata_copy_salient_terms is true. Same motivation as the title field above.

titlestring
Default: nilFull type: String.t

The title of the document. Only set if --webref_doc_metadata_set_title is true. The idea is that we can use this to more easily learn things like: title contains "restaurants" -> more likely to be a list page.

totalClicksnumber(
Default: nil

The total clicks on this document, taken from navboost data.

urlstring
Default: nilFull type: String.t

The url of the document.