-
Notifications
You must be signed in to change notification settings - Fork 161
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat(instrumentation): add fw metrics #187
base: main
Are you sure you want to change the base?
Conversation
89064aa
to
5de3232
Compare
const metricExporter = new OTLPMetricExporter({ | ||
url: "https://bee-collector.apps.fmaas-backend.fmaas.res.ibm.com/v1/metrics", | ||
}); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We will likely need some better domain that is stable. @matoushavlena
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Discussed with @GALLLASMILAN. He is going to lead the next actions required for this.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I will switch the URL to https://bee-telemetry.res.ibm.com/v1/metrics when it's prepared.
package.json
Outdated
"@opentelemetry/exporter-metrics-otlp-http": "^0.54.2", | ||
"@opentelemetry/resources": "^1.27.0", | ||
"@opentelemetry/sdk-metrics": "^1.27.0", | ||
"@opentelemetry/sdk-node": "^0.54.2", | ||
"@opentelemetry/semantic-conventions": "^1.27.0", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do you need to specify the metrics deps directly? Following should work
import { metrics } from "@opentelemetry/sdk-node"
README.md
Outdated
@@ -152,6 +152,10 @@ This project and everyone participating in it are governed by the [Code of Condu | |||
|
|||
All content in these repositories including code has been provided by IBM under the associated open source software license and IBM is under no obligation to provide enhancements, updates, or support. IBM developers produced this code as an open source project (not as an IBM product), and IBM makes no assertions as to the level of quality nor security, and will not be maintaining this code going forward. | |||
|
|||
## Telemetry | |||
|
|||
Some metrics are collected by default. See the [Native Telemetry](./docs/native-telemetry.md) for more detail. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Use absolute path + run yarn docs:build
afterward.
package.json
Outdated
"test:unit": "BEE_FRAMEWORK_INSTRUMENTATION_METRICS_DISABLED=true vitest run src", | ||
"test:unit:watch": "BEE_FRAMEWORK_INSTRUMENTATION_METRICS_DISABLED=true vitest run src", | ||
"test:e2e": "BEE_FRAMEWORK_INSTRUMENTATION_METRICS_DISABLED=true vitest run tests/e2e", | ||
"test:e2e:watch": "BEE_FRAMEWORK_INSTRUMENTATION_METRICS_DISABLED=true vitest watch tests/e2e", | ||
"test:all": "yarn test:unit && yarn test:e2e && yarn test:examples", | ||
"test:examples": "vitest --config ./examples/vitest.examples.config.ts run tests/examples", | ||
"test:examples:watch": "vitest --config ./examples/vitest.examples.config.ts watch tests/examples", | ||
"test:examples": "BEE_FRAMEWORK_INSTRUMENTATION_METRICS_DISABLED=true vitest --config ./examples/vitest.examples.config.ts run tests/examples", | ||
"test:examples:watch": "BEE_FRAMEWORK_INSTRUMENTATION_METRICS_DISABLED=true vitest --config ./examples/vitest.examples.config.ts watch tests/examples", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I would add such variable to .env.test
(just create it, it is going to be automatically used: see tests/setup.ts
)
src/instrumentation/config.ts
Outdated
@@ -18,6 +18,9 @@ import { parseEnv } from "@/internals/env.js"; | |||
import { z } from "zod"; | |||
|
|||
export const INSTRUMENTATION_ENABLED = parseEnv.asBoolean("BEE_FRAMEWORK_INSTRUMENTATION_ENABLED"); | |||
export const INSTRUMENTATION_METRICS_ENABLED = !parseEnv.asBoolean( | |||
"BEE_FRAMEWORK_INSTRUMENTATION_METRICS_DISABLED", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could we not switch the flag to _ENABLED
and set the default to true
?
// send metrics to the public collector | ||
emitter.match( | ||
(event) => event.path === `${basePath}.run.${finishEventName}`, | ||
async () => await metricReader.forceFlush(), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
TIP: metricReader.forceFlush.bind(metricReader)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Here, this inside forceFlush
will always correctly refer to metricReader
, so bind() is not required.
5de3232
to
cbfaa9b
Compare
docs/native-telemetry.md
Outdated
|
||
We understand that not all users want to send telemetry data. You can easily disable this feature by setting an environment variable: | ||
|
||
```bash |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
isn't there support for ```env ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I haven't seen the ```env yet, I will remove the bash
part and the markdown will highlight it correctly.
Signed-off-by: GALLLASMILAN <[email protected]>
Signed-off-by: GALLLASMILAN <[email protected]>
c8e45fb
to
af722fe
Compare
Signed-off-by: GALLLASMILAN <[email protected]>
af722fe
to
969a12a
Compare
package.json
Outdated
@@ -277,6 +279,7 @@ | |||
"@googleapis/customsearch": "^3.2.0", | |||
"@grpc/grpc-js": "^1.12.4", | |||
"@grpc/proto-loader": "^0.7.13", | |||
"@ibm-generative-ai/node-sdk": "~3.2.3", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
BAM is no longer supported
const name = "bee-agent-framework"; | ||
|
||
export const activeTracesMap = new Map<string, string>(); | ||
export const meter = api.metrics.getMeter(name, Version); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'd assume name and version are taken from the OTEL SDK instance. Is that not the case?
src/instrumentation/opentelemetry.ts
Outdated
moduleUsageGauge.record(1, { | ||
source: instance.constructor.name, | ||
type: instance instanceof BaseAgent ? "agent" : instance instanceof Tool ? "tool" : "llm", | ||
framework_version: Version, | ||
os_type: os.type(), | ||
os_release: os.release(), | ||
os_arch: os.arch(), | ||
trace_id: traceId, | ||
event_id: eventId, | ||
}); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It is weird to see most of these labels,
os, fw version, trace are likely supplied by the SDK (not necessarily as a label but as OTEL resource or something like that)
trace_id and event_id are not a good fit for a metric
source and type are good but it signals this should be an incrementing counter not a gauge, it will also likely blow up our collector/backend.
Lastly, and this is likely the most important. I'd say we'd want a label that establishes an anonymous identity once framework is cloned (or first used). A random ID that's stable.
docs/integrations.md
Outdated
const answer = response.messages.at(-1)?.content; | ||
return { next: Workflow.END, update: { answer: answer?.toString() } }; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In the main
branch this is not needed.
examples/integrations/langgraph.ts
Outdated
const answer = response.messages.at(-1)?.content; | ||
return { next: Workflow.END, update: { answer: answer?.toString() } }; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
same
throw new Error("Method not implemented."); | ||
} | ||
public readonly emitter = Emitter.root.child<BaseLLMEvents>({ | ||
namespace: ["bam", "llm"], |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
update namespace please
src/instrumentation/opentelemetry.ts
Outdated
@@ -116,3 +127,26 @@ export function buildTraceTree({ | |||
}, | |||
); | |||
} | |||
|
|||
export const isMeasurementedInstance = (instance: any) => |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
use named function instead
src/instrumentation/types.ts
Outdated
@@ -14,22 +14,22 @@ | |||
* limitations under the License. | |||
*/ | |||
|
|||
import { Attributes, SpanStatus, TimeInput } from "@opentelemetry/api"; | |||
import { api } from "@opentelemetry/sdk-node"; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
import type { api } ...
@@ -20,6 +20,9 @@ import * as R from "remeda"; | |||
import { extractClassName } from "@/serializer/utils.js"; | |||
import { SerializerError } from "@/serializer/error.js"; | |||
import { Cache } from "@/cache/decoratorCache.js"; | |||
import { startMetricNodeSdkReader } from "@/instrumentation/sdk.js"; | |||
|
|||
startMetricNodeSdkReader(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why the start function is here in serializable
module?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Each exported measurable module inherits from this class.
Signed-off-by: Milan Gallas <[email protected]>
Signed-off-by: Milan Gallas <[email protected]>
Description
I added the new custom middleware, that collects
module_usage
statistics with data about os, version etc. This middleware is enabled by default and sends data about the (agent, llm, tool) entities.Data are automatically sent via the open telemetry SDK that is part of the framework right now, but it's configured only for metrics. I use PeriodicMetric reader, but I need to push the metrics when each run ends. Otherwise, I would drop a lot of metrics because the runtime does not wait for the SDK pushing.
This default telemetry can be disabled via the
BEE_FRAMEWORK_INSTRUMENTATION_METRICS_DISABLED
environment variable. See the new./docs/docs/native-telemetry.md
file for more info.Checklist
yarn lint
oryarn lint:fix
yarn format
oryarn format:fix
yarn test:unit
yarn test:e2e