IndexingEmbeddedContentEmbeddedContentInfo

Indexing EmbeddedIndexing

GoogleApi.ContentWarehouse.V1.Model.IndexingEmbeddedContentEmbeddedContentInfo

7
out of 10
High
SEO Impact
This protobuf is used (1) To pass data between EmbeddedExporter and the publisher, and (2) As a member of CompositeDoc, to stick embedded content output into the docjoins. Next tag available: 21

SEO Analysis

AI Generated

Controls how pages are indexed. Without proper indexing, pages cannot appear in search results at all. This model (Indexing Embedded Content Embedded Content Info) contains SEO-relevant attributes including renderedSnapshotQualityScore. Key functionality includes: The document's DOM and render tree produced by WebKit as a side effect of rendering the page. It might be compressed or not. Thus, use indexing::embed...

Actionable Insights for SEOs

  • Monitor for changes in rankings that may correlate with updates to this system
  • Consider how your content strategy aligns with what this signal evaluates
  • Optimize crawl budget by fixing broken links and reducing redirect chains
  • Use robots.txt and sitemap.xml effectively to guide crawling
  • Monitor Google Search Console for crawl errors and indexing issues

Attributes

14
Sort:|Filter:
compressedDocumentTreesstring
Default: nilFull type: String.t

The document's DOM and render tree produced by WebKit as a side effect of rendering the page. It might be compressed or not. Thus, use indexing::embedded_content::UncompressWebkitDocument to decode it.

convertedContentsstring
Default: nilFull type: String.t

The converted contents, as produced by the same DocumentUpdater transaction that generated the render tree. Useful whenever one of our users wants to experiment with deriving an annotation from the render tree.

Default: nilFull type: GoogleApi.ContentWarehouse.V1.Model.IndexingEmbeddedContentEmbeddedLinksInfo.t

Information about all external resources needed to render this page, a.k.a. embedded links. This includes .css files, images embedded in a page, external javascripts, iframes etc.

Default: nilFull type: GoogleApi.ContentWarehouse.V1.Model.HtmlrenderWebkitHeadlessProtoRenderResponse.t

The headless response for rendering the document.

isAlternateSnapshotboolean(
Default: nil

Indicate if the snapshot is generated from alternate snapshot. If true, the snapshot will be exported even if the snapshot quality score is low.

originalEncodinginteger(
Default: nil

The original encoding of the content crawled from trawler. It's the value of enum i18n::encodings::encoding. We put a int32 here instead of encoding proto to maintain the compatibility of "py_api_version = 1"

Default: nilFull type: GoogleApi.ContentWarehouse.V1.Model.IndexingConverterRawRedirectInfo.t

DEPRECATED This field is only populated in fresh_doc which is shutting down.

Default: nilFull type: list(GoogleApi.ContentWarehouse.V1.Model.HtmlrenderWebkitHeadlessProtoReferencedResource.t

Information about all external resources used to render this page, a.k.a. embedded links. This includes .css files, images embedded in a page, external javascripts, iframes etc.

Default: nilFull type: GoogleApi.ContentWarehouse.V1.Model.HtmlrenderWebkitHeadlessProtoImage.t

Only exist in dry run mode.

renderedSnapshotImagestring
Default: nilFull type: String.t

Snapshot image of a rendered html document (possibly encoded as png, jpeg, or webp).

renderedSnapshotMetadataSnapshotSnapshotMetadata →
Default: nilFull type: GoogleApi.ContentWarehouse.V1.Model.SnapshotSnapshotMetadata.t

A collection of values which are needed by the users of the Kodachrome bigtable.

renderedSnapshotQualityScorefloat(
Default: nil

The quality of the image, 0.0 is the worst, 1.0 is the best. If all dependencies are successfully crawled, the quality should be 1.0. If one or more of the dependencies are unknown, the quality will be lower.

Default: nilFull type: GoogleApi.ContentWarehouse.V1.Model.IndexingEmbeddedContentRenderingOutputMetadata.t
Default: nilFull type: GoogleApi.ContentWarehouse.V1.Model.IndexingConverterRichContentData.t

The rich content data to recover the original contents from the converted_contents. Useful for offline content analysis.