Chalk’s online dashboard provides a simple way to view
metrics about performance of your feature pipelines.
However, you may wish to export these metrics from Chalk
into other observability tools so that you can view
your Chalk-related data alongside data from other
systems you maintain.
Chalk tracks various time series metrics that measure the latency and throughput of resolvers and
streaming pipelines.
Chalk uses TimescaleDB to store these metrics. You can use any
OpenMetrics-compatible collector to collect metrics about the execution of your feature pipelines
from Chalk. Examples include:
The table below summarizes the metrics that are available for export.
The headers in the table are the exported metric name followed by the
OpenMetrics metric type
(gauge,
histogram,
summary, or
counter).
Metrics
resolver_latency_secondsSummaryProvides information about the time it takes to compute a resolver.
Tags
idStringThe name of the resolver, for example, my.company.get_user
quantile0.5 | 0.75 | 0.95 | 0.99Whether this latency represents the median, 75th percentile, 95th percentile, or 99th percentile of the latency
resolver_typeonline | offline | streamThe type of the resolver - online, offline, or stream.
query_latency_secondsSummaryProvides information about the time it takes to execute an online query.
Tags
idStringThe name of the query, for example, eligbility_query_v2. Queries without names are labeled "Unnamed"
quantile0.5 | 0.75 | 0.95 | 0.99Whether this latency represents the median, 75th percentile, 95th percentile, or 99th percentile of the latency
cron_run_latency_secondsSummaryProvides information about the time it takes to execute a cron run.
Tags
idStringThe name of the resolver executed by the cron run, for example, my.company.get_user
quantile0.5 | 0.75 | 0.95 | 0.99Whether this latency represents the median, 75th percentile, 95th percentile, or 99th percentile of the latency
feature_requestCounterProvides information about the number of times a feature was computed.
Tags
idStringThe name of the feature, for example, user.age
statussuccess | failureThe status of the computed feature (success or failure)
contextinference | cron | migration | streamingThe context in which the feature was generated
resolver_requestCounterProvides information about the number of times a resolver was computed. This metric informs the number of times that resolvers are being called and the context in which they are called, for example in a cron run as part of a scheduled job or in inference as part of a query plan.
Tags
idStringThe name of the resolver, for example, my.company.get_user
statussuccess | failureThe status of the resolver run (success or failure)
contextinference | cron | migration | streamingThe context in which the resolver ran
resolver_typeonline | offline | streamThe type of the resolver - online, offline, or stream.
cron_run_requestCounterProvides information about the number of times a cron run was executed. This metric is useful for monitoring the status of resolver runs that are scheduled or triggered via API to load data into the online and/or offline store.
Tags
idStringThe name of the resolver executed by the cron run, for example, my.company.get_user
statussuccess | failureThe status of the cron run (success or failure)
cron_feature_writesCounterProvides information about the number of features computed by cron written to online / offline store. This metric is useful for monitoring resolver runs that are scheduled or triggered via API to load data into the online and/or offline store.
Tags
idStringThe name of the resolver executed by the cron run, for example, my.company.get_user
contextStringWhether the features were written to online or offline store.
feature_valueSummaryProvides statistical information about the value of features.
Tags
idStringThe name of the feature, for example, user.age
quantile0.5 | 0.75 | 0.95 | 0.99Whether this value represents the median, 75th percentile, 95th percentile, or 99th percentile of the feature value
query_requestCounterProvides information about the number of times an online query was executed.
Tags
idStringThe name of the query, for example, eligibility_query_v2. Queries without names are labeled "Unnamed"
statussuccess | failureThe status of the query (success or failure)
deploymentGaugeThe active deployment version. This gauge will always have a value of
1 for active deployments. A gauge of this kind is sometimes called an
Info metric.
Tags
idStringThe ID of the deployment.
query_http_responseGaugeThe response counts by HTTP response code.
Tags
environmentStringThe ID of the environment.
resolver_high_water_marksGaugeThe current max_ingested_timestamp in UNIX epoch time for resolvers.
Tags
idStringThe ID of the resolver.