ResearchScamGenericFeatureVector

Research SCAMSearch Infrastructure

GoogleApi.ContentWarehouse.V1.Model.ResearchScamGenericFeatureVector

4
out of 10
Low
SEO Impact
===================================================================== # GenericFeatureVector Last tag used: 23

SEO Analysis

AI Generated

Core search serving infrastructure. While not a direct ranking signal, these systems process and serve search results. This model (Research Scam Generic Feature Vector) contains SEO-relevant attributes including weight. Key functionality includes: The class label of this datapoint. This should be populated if ScaM is being used for nearest-neighbor-based classification.

Actionable Insights for SEOs

  • Understanding this model helps SEOs grasp Google's internal data architecture
  • Consider how this system might interact with other ranking signals

Attributes

19
Sort:|Filter:
classLabelstring
Default: nilFull type: String.t

The class label of this datapoint. This should be populated if ScaM is being used for nearest-neighbor-based classification.

Default: nilFull type: GoogleApi.ContentWarehouse.V1.Model.ResearchScamGenericFeatureVectorCrowding.t
dataIdStrstring
Default: nilFull type: String.t

Optional point id that can contain an arbitrary (unrestricted in content) value except when the data is provided via SSTable (sharded or not). In the case of SSTable the dataid_str must be set for _all points in all shards or the SSTable keys will be used as the values for the data_id_str of the respective points. In either SSTable case--data_id_str provided explicitly or via the key--the values must be unique across all shards.

expirationTimestampdatetime
Default: nilFull type: DateTime.t

A timestamp after which this datapoint is considered no longer valid and is eligible for deletion. The exact meaning varies with application/configuration.

featureDimstring
Default: nilFull type: String.t

- Optionally explicitly specifies the dimensionality of this vector. For dense vectors, this is ignored and, dimensionality is inferred from the number of values specified, and must be identical to this, or unspecified. For sparse vectors, the default value is correct for most users, and allows use of 64-bit hash values for feature indices. If you're using sparse vectors with a limited range of dimension indices and want to do K-Means clustering with them, you may want to specify this explicitly.

featureIndexstring
Default: nilFull type: list(String.t

- for SPARSE vectors, specifies indices of the nonzero dimensions whose values are specified by the , , or field. This field is not used when specifying dense vectors.

featureTypestring
Default: nilFull type: String.t

Describes the type of feature values.

featureValueDoublelist(float(
Default: nil
featureValueFloatlist(number(
Default: nil

Actual feature vector. Only one of the following should be populated. This list has to be kept in sync with FeatureType enum. NOTES: Binary features are stored as ones or zeroes in feature_value_int64. Floating point values (feature_value_float, feature_value_double) may not be NaN.

featureValueInt64string
Default: nilFull type: list(String.t
featureValueStringstring
Default: nilFull type: String.t
Default: nilFull type: GoogleApi.ContentWarehouse.V1.Model.ResearchScamGenericFeatureVectorFixedPointMetadata.t
internalOnlyIsSecondaryPartitionboolean(
Default: nil

copybara:strip_begin INTERNAL USE ONLY! The ScaM Team reserves the right to remove this field and reuse its proto tag without notice. DO NOT USE outside of build_shards_pipeline! This field is used inside build_shards_pipeline to indicate whether this GFV is residing in its secondary (vs. primary) partition. copybara:strip_end

normTypestring
Default: nilFull type: String.t

Describes if data has been normalized and the type.

Default: nilFull type: GoogleApi.ContentWarehouse.V1.Model.ResearchScamQueryMetadata.t

copybara:strip_begin A field that contains metadata information when the datapoint is acting as a query. copybara:strip_end

Default: nilFull type: GoogleApi.ContentWarehouse.V1.Model.ResearchScamGenericFeatureVectorRestrictTokens.t

copybara:strip_end_and_replace reserved 15;

tokenslist(integer(
Default: nil
userinfostring
Default: nilFull type: String.t

This field allows application-specific metadata to be stored in a GFV. This information may be used by custom binaries or in pre- or postprocessing outside of ScaM. Use cases include but are not limited to: Dataset IDs, if multiple datasets are multiplexed into one physical file or network location. An alternative, possibly more human-readable representation of the data represented by this GFV, for e.g. debugging purposes. * Outputting the contents of this field verbatim to the metadata field of the NearestNeighbors.Neighbor proto.

weightnumber(
Default: nil

DEPRECATED fields. Ignored by ScaM binaries. Do not use.