ResearchScienceSearchReconciledMetadata
ResearchSearch InfrastructureGoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchReconciledMetadata
SEO Analysis
AI GeneratedCore search serving infrastructure. While not a direct ranking signal, these systems process and serve search results. This model (Research Science Search Reconciled Metadata) contains SEO-relevant attributes including datasetClassificationScore. Key functionality includes: An identifier as provided by the dataset itself.
Actionable Insights for SEOs
- Monitor for changes in rankings that may correlate with updates to this system
- Consider how your content strategy aligns with what this signal evaluates
Attributes
60identifierFromSourcestringnilFull type: list(String.tAn identifier as provided by the dataset itself.
namestringnilFull type: list(String.tThe names of the dataset.
doistringnilFull type: String.tThe DOI for the dataset. We assume that there is only one.
dateUpdatedResearchScienceSearchDate →nilFull type: GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchDate.tMost recent of the three dates (published, created, modified)
datePublishedResearchScienceSearchDate →nilFull type: GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchDate.tThe date when the dataset was published.
alternateNamestringnilFull type: list(String.tAlternate names and acronyms for the dataset.
locationReconciledForNameboolean(nilIndicates if the location has been reconciled for the dataset name. This is used by LocationExtender to avoid re-annotating the dataset name.
fieldOfStudyResearchScienceSearchFieldOfStudyInfo →nilFull type: list(GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchFieldOfStudyInfo.tField of study: a general, high-level classification of the dataset. This is only populated during indexing time and it is only populated if the classification_source is KNOWLEDGE_GRAPH or it's above inference threshold.
sameAsstringnilFull type: list(String.tIds for other instances (not different versions) of this dataset.
nilFull type: list(GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchLicense.tLicense for the dataset.
versionsSimhashstringnilFull type: String.tA simhash value of the fields used for identifying versions of a dataset. This will be used by the VersionClusterInfoWriter.
descriptionstringnilFull type: list(String.tDescription of the dataset.
coverageEndDateResearchScienceSearchDate →nilFull type: GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchDate.tisAccessibleForFreestringnilFull type: String.tIndicates if the dataset is available for free or behind a paywal http://schema.org/isAccessibleForFree
coverageStartDateResearchScienceSearchDate →nilFull type: GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchDate.tThe start and end date that the dataset covers. If the dataset covers a single timepoint, then start and end dates are the same. Use the ISO 8601 format for dates (e.g., 2006-05-23).
versionEmbeddingVectorlist(number(nilAn embedding for the dataset to be used by the VersionAggregator.
authorListstringnilFull type: String.tA string representation of the authors of the dataset, collected from author and creator in raw metadata. The exact format (e.g., comma-separated, etc.) is up to the extender that populates this field. The assumption is that this string may appear in the UI "as is".
dateCreatedResearchScienceSearchDate →nilFull type: GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchDate.tThe date when the dataset was created.
topSalientTermLabelstringnilFull type: list(String.tTop salient term labels that describe the dataset document body.
keywordstringnilFull type: list(String.tKeywords describing the dataset.
hasCroissantFormatboolean(nilIndicates if the dataset has croissant format (https://github.com/mlcommons/croissant). Use optional so that explicitly setting to false will ensure the value is passed along to the KG instead of being indistinguisable from being unset and thus not set in the KG.
denylistStatusstringnilFull type: list(String.tdatasetClassificationScorefloat(nilProbability that the entity is in fact a dataset (in contrast to spam or website labelled as dataset that does not describe a dataset).
languageCodestringnilFull type: String.tThe 2-letter language code for the source page for the dataset. Same as the language code in source_url_docjoin_info. Populated only when generating output for indexing.
sourceUrlDocjoinInfoResearchScienceSearchSourceUrlDocjoinInfo →nilFull type: GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchSourceUrlDocjoinInfo.tAll the information extracted from docjoin, for the source_url of this dataset, aka DatasetMetadata.source_url.
compactIdentifierFromCitationstringnilFull type: list(String.tCompact Identifier(s) extracted from the citation field. Like in the case of DOI(s) those identify the articles related to the dataset rather than the dataset itself.
mentionedUrlsstringnilFull type: list(String.tMentioned URLs in the description.
dateModifiedResearchScienceSearchDate →nilFull type: GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchDate.tThe date when the dataset was modified.
nilFull type: list(GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchOrganization.tFunder of the dataset.
variablestringnilFull type: list(String.tVariables that the data in the dataset captures (e.g., pressure, salinity, temperature). For now, these are just strings.
numberOfDatasetsAtSourceUrlinteger(nilThe number of datasets at the same source url as this dataset.
spatialCoverageResearchScienceSearchLocation →nilFull type: list(GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchLocation.tLocations that describe spatial coverage of the data. If the data covers multiple locations then each value corresponds to one such location, describing its coordinates, mid, etc.
sourceOrganizationResearchScienceSearchOrganization →nilFull type: list(GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchOrganization.tSource of the dataset: unifies provider, creator, author, publisher etc.
doiFromCitationstringnilFull type: list(String.tDOI(s) extracted from the citation field. In contrast to the "doi" field these DOIs identify the articles related to the dataset rather than the dataset itself.
indexInClusterinteger(nilIndex of this dataset in its cluster of replicas.
dataDownloadResearchScienceSearchDataDownload →nilFull type: list(GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchDataDownload.tThe dataset in downloadable form. There can be multiple data download entries for different file types.
scholarQuerystringnilFull type: String.tQuery string to send to Scholar to obtain the best approximation of citations to the dataset.
publicationResearchScienceSearchCitation →nilFull type: list(GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchCitation.tnilFull type: GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchCatalog.tCatalog that this dataset is a part of.
isBasedOnstringnilFull type: list(String.tA resource (most likely another dataset) from which this dataset is derived or from which it is a modification or adaption. http://schema.org/isBasedOn
versionClusterInfoResearchScienceSearchVersionClusterInfo →nilFull type: GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchVersionClusterInfo.tInformation on the version cluster that the dataset is a part of. This field is populated during the indexing time; the field is populated only if the dataset is part of a version cluster.
urlstringnilFull type: list(String.turls for the dataset, including doi.
nilFull type: list(GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchReplica.tThe info of replicas of this dataset.
isInferredboolean(nilIndicates whether the metadata was inferred using an ML model rather than from the schema.org fields. Use optional so that explicitly setting to false will ensure the value is passed along to the KG instead of being indistinguisable from being unset and thus not set in the KG. This field was originally non-optional; changing to optional is backwards compatible, but protos created prior to being optional won't have has_is_inferred() (go/proto-proposals/proto3-presence#wire-format-semantic-changes).
metadataTypestringnilFull type: String.tscholarlyArticleResearchScienceSearchScholarlyArticle →nilFull type: GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchScholarlyArticle.tFor tables and figures, contains all of the metadata for a scholarly article that was the source of this table or figure. This field is populated only if metadata_type is 'TABLE' or 'FIGURE'.
relatedArticleUrlstringnilFull type: String.tThe url for the article that (likely) describes this dataset.
basicFieldsHashstringnilFull type: String.tA hash of the fields copied by BasicMetadataExtender and the importers. See cs/research/science_search/backend/extender/basic_metadata_extender.h for the list of fields.
compactIdentifierstringnilFull type: list(String.tCompact Identifiers (for example "RRID:SCR_002088") that can be resolved by Identifiers.org or N2T.net meta-resolvers.
imageUrlstringnilFull type: list(String.tThe image urls provided by the dataset (e.g., for thumbnail images).
licenseDeprecatedstringnilFull type: list(String.tLicense for the dataset. DEPRECATED
versionEmbeddingFieldsHashstringnilFull type: String.tA hash of the raw metadata fields used by the VersionEmbeddingExtender.
hasTableSummariesboolean(nilIndicates if the dataset has table summaries. This field is only populated during indexing time.
numberOfScholarCitationsinteger(nilThe number of articles that reference this dataset.
idstringnilFull type: String.tA unique id for the dataset. For the data from Spore, this is the spore id, such as, for example "http://accession.nodc.noaa.gov/8500223#__sid=js0" REQUIRED
measurementTechniquestringnilFull type: list(String.tA technique or technology used in a Dataset corresponding to the method used for measuring the corresponding variable(s) (described using variableMeasured). http://schema.org/measurementTechnique
sourceUrlstringnilFull type: String.tSource url from which we gathered the metadata
fingerprintstringnilFull type: String.tThe fingerprint of basic fields from DatasetMetadata, including: - name - description DEPRECATED
descriptionInHtmlstringnilFull type: list(String.tDescription of the dataset converted to HTML.
datasetClassificationFieldsHashstringnilFull type: String.tA hash of the raw metadata fields used by the QualityExtender.