Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pkg: Create generic AttributesStore #276

Merged
merged 1 commit into from
Dec 13, 2024

Conversation

metalmatze
Copy link
Contributor

Rather than creating many implementations of the AttributesStore per type, let's use the new Go generic types.

Starting to use this code as a library I was wondering if this was possible. Turns out it is.

Rather than creating many implementations of the AttributesStore per type, let's use the new Go generic types.

Starting to use this code as a library I was wondering if this was possible. Turns out it is.
@lquerel
Copy link
Contributor

lquerel commented Nov 29, 2024

@metalmatze I had thought about that at one point but I had a bad experience with Go generics in terms of performance. It might have changed (or my memory is bad), but if I remember correctly, Go generics aren't completely zero-cost. Since these attribute stores are on the hot path, I think it would be good to verify the performance impact with the command trace_producer_simu in the tools directory. If the difference is negligible between the current implementation and the generic based implementation, then this simplification would be more than welcome.

@metalmatze
Copy link
Contributor Author

metalmatze commented Nov 29, 2024

I'm having trouble understanding how the trace_producer_simu works.
Using the tools/trace_gen tool:

go run tools/trace_gen/main.go -output generated.json -format json 

Now, there's a generated.json file that can be passed to the tools/trace_producer_simu.

go run ./tools/trace_producer_simu  -batch-size=10,100,500,1000,2500,5000,1000 --max-batches-per-stream=10,100,500,1000,2500,5000,10000 generated.json 

If I read the source code correctly, the first tool generates a file with batchSize 20. We may need many of these files.

Here's the data of my run: https://gist.github.com/metalmatze/70ce17f8818c2fb8579d1c424c3bd6f4

Is trace_producer_simu even relevant? That tool seems to measure compression but not the performance of the code itself.

@lquerel
Copy link
Contributor

lquerel commented Nov 29, 2024

@metalmatze Sorry, reviewing a PR on Thanksgiving evening before going to bed isn't the best idea in the world... On the substance, I still think we need to verify the performance impact, but it wasn't the right tool :-)
Still in the tools directory, there's a metrics_benchmark tool that allows comparing compression rates between OTLP and OTEL Arrow. In your context, the important thing isn't the compression rate (which should remain unchanged) but the timings for each phase in the OTEL Arrow column. There's a data file in the data directory data/multivariate-metrics.pb that you can pass to this tool.

@metalmatze
Copy link
Contributor Author

Happy Thanksgiving! 🦃

Here are the results: https://gist.github.com/metalmatze/f6b635f261c4e1a017f143c4e9232d55
That was way easier and makes a lot more sense to me.

As you said, there are some performance hits, but you need to decide how impactful they are. I'm totally fine closing this PR if you think there's too much overhead.

Copy link
Contributor

@lquerel lquerel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The performance difference seems small enough to justify the gain in maintainability. I think this improvement can be included in the next release.
Adding @jmacd for visibility.

@jmacd
Copy link
Contributor

jmacd commented Dec 13, 2024

Thanks, @metalmatze.

@jmacd jmacd merged commit 5ea9598 into open-telemetry:main Dec 13, 2024
2 checks passed
@metalmatze metalmatze deleted the attributes-store-generic branch December 14, 2024 20:17
@drewrelmas drewrelmas mentioned this pull request Jan 24, 2025
jmacd pushed a commit that referenced this pull request Jan 24, 2025
- Make a generic `AttributesStore` in pkg/arrow.
[#276](#276)
- Update to v0.117.0 collector dependencies.
[#277](#277),
[#279](#279)
- Deprecate local versions of `fileexporter` and `filereceiver` in favor
of upstream components with related README changes.
[#278](#278)
- Made an incremental step towards deprecating `MetricsLevel` usage in
the project.
[#283](#283)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants