Rename `gen_ai.openai.request.response_format` to `gen_ai.request.response_format` #1757

lmolkova · 2025-01-17T17:31:44Z

Structured outputs and response format is used by multiple GenAI systems, so generalizing it.
Also fixing a tiny bug where we have different id and value for completion token types and other minor nits.

See Cohere, Azure AI Inference, Vertex AI docs

Merge requirement checklist

CONTRIBUTING.md guidelines followed.
Change log entry added, according to the guidelines in When to add a changelog entry.
- If your PR does not need a change log, start the PR title with [chore]
schema-next.yaml updated with changes to existing conventions.

aabmass

Vertex AI/Gemini also supports response schema. I'll take a look and see if it works with this PR

karthikscale3

LGTM

docs/attributes-registry/gen-ai.md

.chloggen/1757.yaml

aabmass · 2025-01-17T18:59:51Z

docs/gen-ai/openai.md

+| `json_object` | JSON object response format | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
+| `json_schema` | JSON schema response format | ![Experimental](https://img.shields.io/badge/-experimental-blue) |


I know it wasn't changed in this PR, but what is the difference between these? Neither one is an actual "format"

that's what OpenAI, Azure AI and Cohere do:

OpenAI /Azure AI Inference

{"type": "json_object"} - json without schema

{"type": "json_schema", "json_schema": {...}} - json with schema

Cohere

{"type": "json_object" } - json without schema

{"type": "json_object", "json_schema": {...}} - json with schema

Vertex AI (correct me if I'm wrong, reading this)

responseMimeType: application/json | text/plain | text/x.enum

responseSchema

I.e. if we apply existing things to vertex, it would be

check if responseMimeType is json

if responseSchema is set, then gen_ai.request.response_format=json_schema

otherwise gen_ai.request.response_format=json_object

if responseMimeType is text, gen_ai.request.response_format=text

in other cases, probably gen_ai.request.response_format={responseMimeType}

I guess the question is whether we should continue doing this?

I think you're reading of Vertex API is correct. Looking ahead, there is a separate API for generating images which also supports more mime types https://cloud.google.com/vertex-ai/generative-ai/docs/model-reference/imagen-api#output-options. I'm guessing gen_ai.request.response_format attribute would be shared for multimodal as well–what do you think of changing this attribute to a mime type?

Regarding json schema, I opened #1760 to support capturing the schema if available.

I'm guessing gen_ai.request.response_format attribute would be shared for multimodal as well–what do you think of changing this attribute to a mime type?

great point!

Just did a basic research on types:

OpenAI

Audio response format types: mp3, opus, aac, flac, wav, and pcm.

Realtime response format types: pcm16, g711_ulaw, or g711_alaw - this one is called output_audio_format

Image response format types: url or b64_json

Vertex AI uses mime-types for images too

I think mime-type won't work in some cases (e.g. image URL), but also it requires to map everything to a mime type. It's not straightforward - e.g. b64_json DALL-E produces seems to be webp but it's not documented (!?) and internet suggests to decode and look at the image header - not something instrumentation should do.

Seeing output_audio_format makes me think that there is a future where you could specify multiple formats in one request (generate video and audio, etc) and we might need different attributes to capture text_format, audio_format, etc.

I think I see two options:

Use response_format as a union of all possible formats across modalities: (text | json | mp3 | url | .. )

we can define a few well-known types (e.g. json would be used for all json things - json_object, json_schema, application/json, etc)

for everything else it would match constant used by the specific system, Vertex can use audio/x-alaw-basic and openai would do g711_alaw, we don't really try to unify all

Break it down into format-per-modality

gen_ai.request.text.response.format = plain | json | bson | xml | python

gen_ai.request.text.response.schema = {schema type name} - this is only for text and only for structured output

gen_ai.request.image.response_format = url | png | webp | b64_json - we can do some normalization for a few things and record the rest as is

gen_ai.request.image.response.aspect_ratio = 4:3 - there are other properties that would be worth recording

...

Either way we could/should add something like
gen_ai.request.response.type = text | image | video | audio | ... - this is the modality

(all attribute names need polishing, but I hope you get the idea)

I hope we can also make it symmetrical with request format and the actual response output type.
Let me do a bit more research, but I'm leaning towards option 2 since it provides typed way to record modality-specific options

I hope we can also make it symmetrical with request format and the actual response output type. Let me do a bit more research, but I'm leaning towards option 2 since it provides typed way to record modality-specific options

Option 2 sounds good to me on cursory read through. It sounds like a bigger follow up task, so this PR LGTM if you want to address the rest later.

I introduced gen_ai.output.json.schema.name and gen_ai.output.type following option 2, let's discuss on the GenAI call tomorrow

Did you push the updated code yet?

apparently no, did it now, thanks!

Co-authored-by: Trent Mick <trentm@gmail.com>

trentm · 2025-01-24T20:04:12Z

schema-next.yaml

+        # https://github.com/open-telemetry/semantic-conventions/pull/1757
+        - rename_attributes:
+            attribute_map:
+              gen_ai.openai.request.response_format: gen_ai.request.response_format


This is inaccurate now.

trentm · 2025-01-24T20:04:30Z

.chloggen/1757.yaml

@@ -0,0 +1,4 @@
+change_type: breaking
+component: gen-ai
+note: "Rename `gen_ai.openai.request.response_format` to `gen_ai.request.response_format`"


This is inaccurate now.
As well, the PR title should change.

lmolkova changed the title ~~Rename gen-ai.openai.request.response_format to gen-ai.request.response_format generic~~ Rename gen-ai.openai.request.response_format to gen-ai.request.response_format Jan 17, 2025

lmolkova marked this pull request as ready for review January 17, 2025 17:35

lmolkova requested review from a team as code owners January 17, 2025 17:35

lmolkova added area:gen-ai breaking labels Jan 17, 2025

aabmass reviewed Jan 17, 2025

View reviewed changes

karthikscale3 approved these changes Jan 17, 2025

View reviewed changes

trentm approved these changes Jan 17, 2025

View reviewed changes

docs/attributes-registry/gen-ai.md Outdated Show resolved Hide resolved

.chloggen/1757.yaml Outdated Show resolved Hide resolved

aabmass reviewed Jan 17, 2025

View reviewed changes

aabmass mentioned this pull request Jan 17, 2025

GenAI capture json schema and tool schemas #1760

Open

lmolkova changed the title ~~Rename gen-ai.openai.request.response_format to gen-ai.request.response_format~~ Rename gen_ai.openai.request.response_format to gen_ai.request.response_format Jan 18, 2025

lmolkova mentioned this pull request Jan 18, 2025

Don't render deprecated enum members #1764

Merged

3 tasks

aabmass approved these changes Jan 22, 2025

View reviewed changes

lmolkova and others added 7 commits January 22, 2025 19:24

Make gen-ai.openai.request.response_format generic and clean ups

3dda66c

add respone format to common attributes

aa3ef14

changelog

37eddbc

nit

6ca5fdd

nit

0039379

Apply suggestions from code review

f522c83

Co-authored-by: Trent Mick <trentm@gmail.com>

break down response_format to output type and schema name

07f27d5

lmolkova requested review from karthikscale3, aabmass and trentm January 23, 2025 05:44

TaoChenOSU approved these changes Jan 23, 2025

View reviewed changes

lmolkova force-pushed the gen-ai-response-format-generic branch from 5e574a5 to 07f27d5 Compare January 23, 2025 21:51

lmolkova requested a review from TaoChenOSU January 23, 2025 21:52

trentm reviewed Jan 24, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rename `gen_ai.openai.request.response_format` to `gen_ai.request.response_format` #1757

Rename `gen_ai.openai.request.response_format` to `gen_ai.request.response_format` #1757

lmolkova commented Jan 17, 2025 •

edited

Loading

aabmass left a comment

karthikscale3 left a comment

aabmass Jan 17, 2025

lmolkova Jan 18, 2025 •

edited

Loading

aabmass Jan 21, 2025

lmolkova Jan 21, 2025 •

edited

Loading

aabmass Jan 22, 2025

lmolkova Jan 23, 2025

aabmass Jan 23, 2025

lmolkova Jan 23, 2025

trentm Jan 24, 2025

trentm Jan 24, 2025

		\| `json_object` \| JSON object response format \| ![Experimental](https://img.shields.io/badge/-experimental-blue) \|
		\| `json_schema` \| JSON schema response format \| ![Experimental](https://img.shields.io/badge/-experimental-blue) \|

Rename gen_ai.openai.request.response_format to gen_ai.request.response_format #1757

Are you sure you want to change the base?

Rename gen_ai.openai.request.response_format to gen_ai.request.response_format #1757

Conversation

lmolkova commented Jan 17, 2025 • edited Loading

Merge requirement checklist

aabmass left a comment

Choose a reason for hiding this comment

karthikscale3 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lmolkova Jan 18, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lmolkova Jan 21, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Rename `gen_ai.openai.request.response_format` to `gen_ai.request.response_format` #1757

Rename `gen_ai.openai.request.response_format` to `gen_ai.request.response_format` #1757

lmolkova commented Jan 17, 2025 •

edited

Loading

lmolkova Jan 18, 2025 •

edited

Loading

lmolkova Jan 21, 2025 •

edited

Loading