ResearchScienceSearchSourceUrlDocjoinInfo

ResearchSearch Infrastructure

GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchSourceUrlDocjoinInfo

6
out of 10
Medium
SEO Impact
The proto containing all the information we extracted from docjoin, for the source_url of the dataset. NEXT TAG: 18

SEO Analysis

AI Generated

Core search serving infrastructure. While not a direct ranking signal, these systems process and serve search results. This model (Research Science Search Source Url Docjoin Info) contains SEO-relevant attributes including navboostQuery, pagerank, pagerankNs. Key functionality includes: The url used to display in the google search results.

Actionable Insights for SEOs

  • Understanding this model helps SEOs grasp Google's internal data architecture
  • Consider how this system might interact with other ranking signals

Attributes

17
Sort:|Filter:
dataSourcestring
Default: nilFull type: String.t
displayUrlstring
Default: nilFull type: String.t

The url used to display in the google search results.

docidstring
Default: nilFull type: String.t

The docid of the document.

indexTierstring
Default: nilFull type: list(String.t

Index tiers (BASE, UNIFIED_ZEPPELIN, etc) that the document belongs to. NOTE: Each document may belong to multiple tiers. NOTE: The original data type is an enum CompositeDoc::SubIndexType. However we don't want to depend on segindexer/compositedoc.proto because the proto is too large. Instead, we use CompositeDoc::SubIndexType_Name( subindexid) to convert into a string representation. To convert string back to CompositeDoc::SubIndexType, use CompositeDoc::SubIndexType_Parse.

languageCodestring
Default: nilFull type: String.t

The language of the document in the string representation of LanguageCode. Converts from Language Enum to LanguageCode through i18n/identifiers/langenclanguagecodeconverter.h Please use i18n/identifiers/languagecodeconverter.h for converting between LanguageCode and string representation.

latestPageUpdateDatestring
Default: nilFull type: String.t

The syntactic date of a dataset document that reflects the publication date of the content.

Default: nilFull type: list(GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchNavboostQueryInfo.t

A sequence of Navboost queries for the dataset source_url.

pagerankinteger(
Default: nil

The page rank of the document. DEPRECATED in favour of Pagerank_NS. Do not use as it is no longer maintained in docjoins and can break at any moment.

pagerankNsinteger(
Default: nil

The production pagerank value of the document.

Default: nilFull type: GoogleApi.ContentWarehouse.V1.Model.FatcatCompactDocClassification.t

Petacat classifications for the web document. Normally the results from calling Petacat come in a PetacatResponse, which is very flexible and extensible. This proto takes most of the flexibility away - only rephil clusters, taxonomic classifications, and binary classifications, with discretized weights.

Default: nilFull type: GoogleApi.ContentWarehouse.V1.Model.QualitySalientTermsSalientTermSet.t

A set of salient terms extracted fromthe document. DEPRECATEAD. Moved to DatasetMetadata for performance reasons.

Default: nilFull type: GoogleApi.ContentWarehouse.V1.Model.ScienceIndexSignal.t

Science per-doc data for inclusion in websearch.

sporeGraphMidstring
Default: nilFull type: list(String.t

A set of entities from WebRef annotations that are in SPORE_GRAPH.

titlestring
Default: nilFull type: String.t

The title of the document.

Default: nilFull type: list(GoogleApi.ContentWarehouse.V1.Model.RepositoryWebrefWebrefEntity.t

A set of top entities from WebrefAnnotation, top is defined by topicality score, see go/topicality-score for detail. DEPRECATED. See label_to_mids_map instead.

urlstring
Default: nilFull type: String.t

The url of the document.

Default: nilFull type: list(GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchSourceUrlDocjoinInfoWebrefEntityInfo.t

A set of entities copied from WebRefEntities on cDoc.