CompositeDoc
Composite DocDocument ProcessingGoogleApi.ContentWarehouse.V1.Model.CompositeDoc
SEO Analysis
AI GeneratedPart of Google's Composite Document system, which brings together all known information about a URL into a single unified document representation. This includes content, links, quality signals, and metadata from multiple sources. The composite document is the complete picture Google has of a page and serves as the input for ranking algorithms.
Actionable Insights for SEOs
- Monitor for changes in rankings that may correlate with updates to this system
- Consider how your content strategy aligns with what this signal evaluates
Attributes
44localizedvariationsCompositeDocLocalizedVariations →nilFull type: GoogleApi.ContentWarehouse.V1.Model.CompositeDocLocalizedVariations.tlocalizedAlternateNameIndexingConverterLocalizedAlternateName →nilFull type: list(GoogleApi.ContentWarehouse.V1.Model.IndexingConverterLocalizedAlternateName.tLocalized alternate names are similar to alternate names, except that it is associated with a language different from its canonical. This is the subset of webmaster-provided localized alternate names being in the dup cluster of this document. Used during serving for swapping in the URL based on regional and language preferences of the user.
forwardingdupCompositeDocForwardingDup →nilFull type: list(GoogleApi.ContentWarehouse.V1.Model.CompositeDocForwardingDup.tperDocDataPerDocData →nilFull type: GoogleApi.ContentWarehouse.V1.Model.PerDocData.taccessRequirementsIndexingPrivacyAccessAccessRequirements →nilFull type: GoogleApi.ContentWarehouse.V1.Model.IndexingPrivacyAccessAccessRequirements.tContains necessary information to enforce row level Docjoin access control.
propertiesDocProperties →nilFull type: GoogleApi.ContentWarehouse.V1.Model.DocProperties.tbadSslCertificateIndexingBadSSLCertificate →nilFull type: GoogleApi.ContentWarehouse.V1.Model.IndexingBadSSLCertificate.tThis field is present iff the page has a bad SSL certificate itself or in its redirect chain.
extradupCompositeDocExtraDup →nilFull type: list(GoogleApi.ContentWarehouse.V1.Model.CompositeDocExtraDup.tsubindexidstringnilFull type: list(String.tptokenPtokenPToken →nilFull type: GoogleApi.ContentWarehouse.V1.Model.PtokenPToken.tContains information necessary to perform policy decision on the usage of the data assosiated with this cdoc.
richcontentDataIndexingConverterRichContentData →nilFull type: GoogleApi.ContentWarehouse.V1.Model.IndexingConverterRichContentData.tIf present, indicates that some content was inserted, deleted, or replaced in the document's content (in CompositeDoc::doc::Content::Representation), and stores information about what was inserted, deleted, or replaced.
scaledIndyRankinteger(nilto copy to per-doc
nilFull type: list(GoogleApi.ContentWarehouse.V1.Model.QualityProseCSEUrlInfo.tindexingIntermediatestringnilFull type: String.tSerialized indexing intermediate data.
docVideosImageRepositoryVideoProperties →nilFull type: list(GoogleApi.ContentWarehouse.V1.Model.ImageRepositoryVideoProperties.tInfo about videos embedded in the document.
indexinginfoCompositeDocIndexingInfo →nilFull type: GoogleApi.ContentWarehouse.V1.Model.CompositeDocIndexingInfo.tlocalinfoLocalWWWInfo →nilFull type: GoogleApi.ContentWarehouse.V1.Model.LocalWWWInfo.tstorageRowTimestampMicrosstringnilFull type: String.tRow timestamp in CDoc storage.
partialUpdateInfoCompositeDocPartialUpdateInfo →nilFull type: GoogleApi.ContentWarehouse.V1.Model.CompositeDocPartialUpdateInfo.tOnly present in partial cdocs.
additionalchecksumsCompositeDocAdditionalChecksums →nilFull type: GoogleApi.ContentWarehouse.V1.Model.CompositeDocAdditionalChecksums.tanchorsAnchors →nilFull type: GoogleApi.ContentWarehouse.V1.Model.Anchors.tMark as non-personal since no personal fields will be populated in anchors.link_additional_info and anchors.additional_info. For more details of Search personal data, see go/dma52-search-cdoc-fields.
registrationinfoRegistrationInfo →nilFull type: GoogleApi.ContentWarehouse.V1.Model.RegistrationInfo.tInformation about the most recent creation and expiration of this domain. It's extracted from domainedge signal.
docAttachmentsProto2BridgeMessageSet →nilFull type: GoogleApi.ContentWarehouse.V1.Model.Proto2BridgeMessageSet.tA generic container to hold document annotations and signals. For a full list of extensions live today, see go/wde.
docinfoPassthroughAttachmentsProto2BridgeMessageSet →nilFull type: GoogleApi.ContentWarehouse.V1.Model.Proto2BridgeMessageSet.tThis message set is used for data pushed into the index using the signals framework that is never to be used in Mustang or TG Continuum scoring/snippeting code. Any protocol buffer stored in this message set is automatically returned in a docinfo response - it ends up in the "info" message set in the WWWSnippetResponse, so it can be used in post-doc twiddlers and for display in GWS with no code changes in Mustang or Teragoogle.
docImagesImageData →nilFull type: list(GoogleApi.ContentWarehouse.V1.Model.ImageData.tInfo about "selected" images associated with the document for which we (already) have ImageData. For each image URL, some fixed number of documents are selected as web referrers for the image URL, and within those selected documents, we say the image is "selected". Within the remaining documents, we say the image is "rejected". Note that this distinction is slightly different from selected for indexing. Only images within doc_images where is_indexed_by_imagesearch is true will be selected for indexing. You can find the rejected images at composite_doc.doc_attachments().get(). You can find images that are selected, but for which we have no ImageData (yet) at composite_doc.image_indexing_info().selected_not_indexed_image_link()
includedcontentCompositeDocIncludedContent →nilFull type: list(GoogleApi.ContentWarehouse.V1.Model.CompositeDocIncludedContent.tporninfoClassifierPornDocumentData →nilFull type: GoogleApi.ContentWarehouse.V1.Model.ClassifierPornDocumentData.tPorn related data used for image and web search porn classification as well as for diagnostics purposes.
urldatestringnilFull type: String.tDate in the url extracted by quality/snippets/urldate/date-in-url.cc This is given as midnight GMT on the date in question.
embeddedContentInfoIndexingEmbeddedContentEmbeddedContentInfo →nilFull type: GoogleApi.ContentWarehouse.V1.Model.IndexingEmbeddedContentEmbeddedContentInfo.tData produced by the embedded-content system. This is a thin message, containing only embedded_links_info data for the embedder and JavaScript/CSS embedded links (the embedded-content bigtable also contains snapshots, compressed document trees and all embedded link types). Provided using the index signal API.
richsnippetRichsnippetsPageMap →nilFull type: GoogleApi.ContentWarehouse.V1.Model.RichsnippetsPageMap.trich snippet extracted from the content of a document.
qualitysignalsCompositeDocQualitySignals →nilFull type: GoogleApi.ContentWarehouse.V1.Model.CompositeDocQualitySignals.tanchorStatsIndexingDocjoinerAnchorStatistics →nilFull type: GoogleApi.ContentWarehouse.V1.Model.IndexingDocjoinerAnchorStatistics.tMark as non-personal since it's an aggregation of anchors. For more details of Search personal data, see go/dma52-search-cdoc-fields.
alternatenameCompositeDocAlternateName →nilFull type: list(GoogleApi.ContentWarehouse.V1.Model.CompositeDocAlternateName.tdataVersionIndexingDocjoinerDataVersion →nilFull type: GoogleApi.ContentWarehouse.V1.Model.IndexingDocjoinerDataVersion.tContains the tracking version of various data fields in CompositeDoc.
syntacticDateQualityTimebasedSyntacticDate →nilFull type: GoogleApi.ContentWarehouse.V1.Model.QualityTimebasedSyntacticDate.turlstringnilFull type: String.tWARNING!!! "url" field in CompositeDoc is optional, and is usually missing: e.g., Docjoin CompositeDoc's don't have CompositeDoc::url. has_url() checking is often useful. So don't rely on CompositeDoc::url unless you're sure otherwise. Usually you want to use CompositeDoc::doc::url instead.
docjoinsOnSpannerCommitTimestampMicrosstringnilFull type: String.tThe commit timestamp of a CDoc update to Docjoins on Spanner.
ContentChecksum96stringnilFull type: String.tVisible content checksum as computed by repository::parsehandler::checksum::Checksum96bitsParseHandler. The value is a Fprint96 in "key format" (i.e., by Fprint96::AsKey()).
liveexperimentinfoCompositeDocLiveExperimentInfo →nilFull type: GoogleApi.ContentWarehouse.V1.Model.CompositeDocLiveExperimentInfo.tlabelDataQualityLabelsGoogleLabelData →nilFull type: GoogleApi.ContentWarehouse.V1.Model.QualityLabelsGoogleLabelData.tThis field associates a document to particular labels and assigns confidence values to them.
sitemapSitemap →nilFull type: GoogleApi.ContentWarehouse.V1.Model.Sitemap.tSitelinks: a collection of interesting links a user might be interested in, given they are interested in this document. WARNING: this is different from the crawler Sitemaps (see SitemapsSignals in the attachments).
robotsinfolistCompositeDocRobotsInfoList →nilFull type: GoogleApi.ContentWarehouse.V1.Model.CompositeDocRobotsInfoList.tnilFull type: GoogleApi.ContentWarehouse.V1.Model.GDocumentBase.tcsePagerankCutoffinteger(nilURL should only be selected for CSE Index if it's pagerank is higher than cse_pagerank_cutoff.