Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
2 changes: 1 addition & 1 deletion .flake8
Original file line number Diff line number Diff line change
Expand Up @@ -16,7 +16,7 @@
[flake8]
# TODO(https://github.com/googleapis/gapic-generator-python/issues/2333):
# Resolve flake8 lint issues
ignore = E203, E231, E266, E501, W503, E704
ignore = E203, E231, E266, E501, W503
exclude =
# TODO(https://github.com/googleapis/gapic-generator-python/issues/2333):
# Ensure that generated code passes flake8 lint
Expand Down
2 changes: 1 addition & 1 deletion .github/.OwlBot.lock.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -13,4 +13,4 @@
# limitations under the License.
docker:
image: gcr.io/cloud-devrel-public-resources/owlbot-python:latest
digest: sha256:3b3a31be60853477bc39ed8d9bac162cac3ba083724cecaad54eb81d4e4dae9c
digest: sha256:4a9e5d44b98e8672e2037ee22bc6b4f8e844a2d75fcb78ea8a4b38510112abc6
Original file line number Diff line number Diff line change
Expand Up @@ -235,15 +235,15 @@ class AutoTransformation(proto.Message):
class NumericTransformation(proto.Message):
r"""Training pipeline will perform following transformation functions.

- The value converted to float32.
- The z_score of the value.
- log(value+1) when the value is greater than or equal to 0.
Otherwise, this transformation is not applied and the value is
considered a missing value.
- z_score of log(value+1) when the value is greater than or equal
to 0. Otherwise, this transformation is not applied and the value
is considered a missing value.
- A boolean value that indicates whether the value is valid.
- The value converted to float32.
- The z_score of the value.
- log(value+1) when the value is greater than or equal to 0.
Otherwise, this transformation is not applied and the value is
considered a missing value.
- z_score of log(value+1) when the value is greater than or equal to
0. Otherwise, this transformation is not applied and the value is
considered a missing value.
- A boolean value that indicates whether the value is valid.

Attributes:
column_name (str):
Expand All @@ -268,13 +268,13 @@ class NumericTransformation(proto.Message):
class CategoricalTransformation(proto.Message):
r"""Training pipeline will perform following transformation functions.

- The categorical string as is--no change to case, punctuation,
spelling, tense, and so on.
- Convert the category name to a dictionary lookup index and
generate an embedding for each index.
- Categories that appear less than 5 times in the training dataset
are treated as the "unknown" category. The "unknown" category
gets its own special lookup index and resulting embedding.
- The categorical string as is--no change to case, punctuation,
spelling, tense, and so on.
- Convert the category name to a dictionary lookup index and
generate an embedding for each index.
- Categories that appear less than 5 times in the training dataset
are treated as the "unknown" category. The "unknown" category gets
its own special lookup index and resulting embedding.

Attributes:
column_name (str):
Expand All @@ -289,13 +289,13 @@ class CategoricalTransformation(proto.Message):
class TimestampTransformation(proto.Message):
r"""Training pipeline will perform following transformation functions.

- Apply the transformation functions for Numerical columns.
- Determine the year, month, day,and weekday. Treat each value from
the
- timestamp as a Categorical column.
- Invalid numerical values (for example, values that fall outside
of a typical timestamp range, or are extreme values) receive no
special treatment and are not removed.
- Apply the transformation functions for Numerical columns.
- Determine the year, month, day,and weekday. Treat each value from
the
- timestamp as a Categorical column.
- Invalid numerical values (for example, values that fall outside of
a typical timestamp range, or are extreme values) receive no
special treatment and are not removed.

Attributes:
column_name (str):
Expand All @@ -304,15 +304,15 @@ class TimestampTransformation(proto.Message):
The format in which that time field is expressed. The
time_format must either be one of:

- ``unix-seconds``
- ``unix-milliseconds``
- ``unix-microseconds``
- ``unix-nanoseconds`` (for respectively number of seconds,
milliseconds, microseconds and nanoseconds since start of
the Unix epoch); or be written in ``strftime`` syntax. If
time_format is not set, then the default format is RFC
3339 ``date-time`` format, where ``time-offset`` =
``"Z"`` (e.g. 1985-04-12T23:20:50.52Z)
- ``unix-seconds``
- ``unix-milliseconds``
- ``unix-microseconds``
- ``unix-nanoseconds`` (for respectively number of seconds,
milliseconds, microseconds and nanoseconds since start of
the Unix epoch); or be written in ``strftime`` syntax. If
time_format is not set, then the default format is RFC
3339 ``date-time`` format, where ``time-offset`` = ``"Z"``
(e.g. 1985-04-12T23:20:50.52Z)
invalid_values_allowed (bool):
If invalid values is allowed, the training
pipeline will create a boolean feature that
Expand All @@ -337,15 +337,14 @@ class TimestampTransformation(proto.Message):
class TextTransformation(proto.Message):
r"""Training pipeline will perform following transformation functions.

- The text as is--no change to case, punctuation, spelling, tense,
and so on.
- Tokenize text to words. Convert each words to a dictionary lookup
index and generate an embedding for each index. Combine the
embedding of all elements into a single embedding using the mean.
- Tokenization is based on unicode script boundaries.
- Missing values get their own lookup index and resulting
embedding.
- Stop-words receive no special treatment and are not removed.
- The text as is--no change to case, punctuation, spelling, tense,
and so on.
- Tokenize text to words. Convert each words to a dictionary lookup
index and generate an embedding for each index. Combine the
embedding of all elements into a single embedding using the mean.
- Tokenization is based on unicode script boundaries.
- Missing values get their own lookup index and resulting embedding.
- Stop-words receive no special treatment and are not removed.

Attributes:
column_name (str):
Expand All @@ -361,9 +360,9 @@ class NumericArrayTransformation(proto.Message):
r"""Treats the column as numerical array and performs following
transformation functions.

- All transformations for Numerical types applied to the average of
the all elements.
- The average of empty arrays is treated as zero.
- All transformations for Numerical types applied to the average of
the all elements.
- The average of empty arrays is treated as zero.

Attributes:
column_name (str):
Expand All @@ -389,11 +388,11 @@ class CategoricalArrayTransformation(proto.Message):
r"""Treats the column as categorical array and performs following
transformation functions.

- For each element in the array, convert the category name to a
dictionary lookup index and generate an embedding for each index.
Combine the embedding of all elements into a single embedding
using the mean.
- Empty arrays treated as an embedding of zeroes.
- For each element in the array, convert the category name to a
dictionary lookup index and generate an embedding for each index.
Combine the embedding of all elements into a single embedding
using the mean.
- Empty arrays treated as an embedding of zeroes.

Attributes:
column_name (str):
Expand All @@ -409,10 +408,10 @@ class TextArrayTransformation(proto.Message):
r"""Treats the column as text array and performs following
transformation functions.

- Concatenate all text values in the array into a single text value
using a space (" ") as a delimiter, and then treat the result as
a single text value. Apply the transformations for Text columns.
- Empty arrays treated as an empty text.
- Concatenate all text values in the array into a single text value
using a space (" ") as a delimiter, and then treat the result as a
single text value. Apply the transformations for Text columns.
- Empty arrays treated as an empty text.

Attributes:
column_name (str):
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -39,7 +39,7 @@ class ExportEvaluatedDataItemsConfig(proto.Message):

If not specified, then results are exported to the following
auto-created BigQuery table:
<project_id>:export_evaluated_examples_<model_name>_<yyyy_MM_dd'T'HH_mm_ss_SSS'Z'>.evaluated_examples
<project_id>:export_evaluated_examples\_<model_name>\_<yyyy_MM_dd'T'HH_mm_ss_SSS'Z'>.evaluated_examples
override_existing_table (bool):
If true and an export destination is
specified, then the contents of the destination
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -235,15 +235,15 @@ class AutoTransformation(proto.Message):
class NumericTransformation(proto.Message):
r"""Training pipeline will perform following transformation functions.

- The value converted to float32.
- The z_score of the value.
- log(value+1) when the value is greater than or equal to 0.
Otherwise, this transformation is not applied and the value is
considered a missing value.
- z_score of log(value+1) when the value is greater than or equal
to 0. Otherwise, this transformation is not applied and the value
is considered a missing value.
- A boolean value that indicates whether the value is valid.
- The value converted to float32.
- The z_score of the value.
- log(value+1) when the value is greater than or equal to 0.
Otherwise, this transformation is not applied and the value is
considered a missing value.
- z_score of log(value+1) when the value is greater than or equal to
0. Otherwise, this transformation is not applied and the value is
considered a missing value.
- A boolean value that indicates whether the value is valid.

Attributes:
column_name (str):
Expand All @@ -268,13 +268,13 @@ class NumericTransformation(proto.Message):
class CategoricalTransformation(proto.Message):
r"""Training pipeline will perform following transformation functions.

- The categorical string as is--no change to case, punctuation,
spelling, tense, and so on.
- Convert the category name to a dictionary lookup index and
generate an embedding for each index.
- Categories that appear less than 5 times in the training dataset
are treated as the "unknown" category. The "unknown" category
gets its own special lookup index and resulting embedding.
- The categorical string as is--no change to case, punctuation,
spelling, tense, and so on.
- Convert the category name to a dictionary lookup index and
generate an embedding for each index.
- Categories that appear less than 5 times in the training dataset
are treated as the "unknown" category. The "unknown" category gets
its own special lookup index and resulting embedding.

Attributes:
column_name (str):
Expand All @@ -289,13 +289,13 @@ class CategoricalTransformation(proto.Message):
class TimestampTransformation(proto.Message):
r"""Training pipeline will perform following transformation functions.

- Apply the transformation functions for Numerical columns.
- Determine the year, month, day,and weekday. Treat each value from
the
- timestamp as a Categorical column.
- Invalid numerical values (for example, values that fall outside
of a typical timestamp range, or are extreme values) receive no
special treatment and are not removed.
- Apply the transformation functions for Numerical columns.
- Determine the year, month, day,and weekday. Treat each value from
the
- timestamp as a Categorical column.
- Invalid numerical values (for example, values that fall outside of
a typical timestamp range, or are extreme values) receive no
special treatment and are not removed.

Attributes:
column_name (str):
Expand All @@ -304,15 +304,15 @@ class TimestampTransformation(proto.Message):
The format in which that time field is expressed. The
time_format must either be one of:

- ``unix-seconds``
- ``unix-milliseconds``
- ``unix-microseconds``
- ``unix-nanoseconds`` (for respectively number of seconds,
milliseconds, microseconds and nanoseconds since start of
the Unix epoch); or be written in ``strftime`` syntax. If
time_format is not set, then the default format is RFC
3339 ``date-time`` format, where ``time-offset`` =
``"Z"`` (e.g. 1985-04-12T23:20:50.52Z)
- ``unix-seconds``
- ``unix-milliseconds``
- ``unix-microseconds``
- ``unix-nanoseconds`` (for respectively number of seconds,
milliseconds, microseconds and nanoseconds since start of
the Unix epoch); or be written in ``strftime`` syntax. If
time_format is not set, then the default format is RFC
3339 ``date-time`` format, where ``time-offset`` = ``"Z"``
(e.g. 1985-04-12T23:20:50.52Z)
invalid_values_allowed (bool):
If invalid values is allowed, the training
pipeline will create a boolean feature that
Expand All @@ -337,15 +337,14 @@ class TimestampTransformation(proto.Message):
class TextTransformation(proto.Message):
r"""Training pipeline will perform following transformation functions.

- The text as is--no change to case, punctuation, spelling, tense,
and so on.
- Tokenize text to words. Convert each words to a dictionary lookup
index and generate an embedding for each index. Combine the
embedding of all elements into a single embedding using the mean.
- Tokenization is based on unicode script boundaries.
- Missing values get their own lookup index and resulting
embedding.
- Stop-words receive no special treatment and are not removed.
- The text as is--no change to case, punctuation, spelling, tense,
and so on.
- Tokenize text to words. Convert each words to a dictionary lookup
index and generate an embedding for each index. Combine the
embedding of all elements into a single embedding using the mean.
- Tokenization is based on unicode script boundaries.
- Missing values get their own lookup index and resulting embedding.
- Stop-words receive no special treatment and are not removed.

Attributes:
column_name (str):
Expand All @@ -361,9 +360,9 @@ class NumericArrayTransformation(proto.Message):
r"""Treats the column as numerical array and performs following
transformation functions.

- All transformations for Numerical types applied to the average of
the all elements.
- The average of empty arrays is treated as zero.
- All transformations for Numerical types applied to the average of
the all elements.
- The average of empty arrays is treated as zero.

Attributes:
column_name (str):
Expand All @@ -389,11 +388,11 @@ class CategoricalArrayTransformation(proto.Message):
r"""Treats the column as categorical array and performs following
transformation functions.

- For each element in the array, convert the category name to a
dictionary lookup index and generate an embedding for each index.
Combine the embedding of all elements into a single embedding
using the mean.
- Empty arrays treated as an embedding of zeroes.
- For each element in the array, convert the category name to a
dictionary lookup index and generate an embedding for each index.
Combine the embedding of all elements into a single embedding
using the mean.
- Empty arrays treated as an embedding of zeroes.

Attributes:
column_name (str):
Expand All @@ -409,10 +408,10 @@ class TextArrayTransformation(proto.Message):
r"""Treats the column as text array and performs following
transformation functions.

- Concatenate all text values in the array into a single text value
using a space (" ") as a delimiter, and then treat the result as
a single text value. Apply the transformations for Text columns.
- Empty arrays treated as an empty text.
- Concatenate all text values in the array into a single text value
using a space (" ") as a delimiter, and then treat the result as a
single text value. Apply the transformations for Text columns.
- Empty arrays treated as an empty text.

Attributes:
column_name (str):
Expand Down
Loading
Loading