-
Notifications
You must be signed in to change notification settings - Fork 895
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add document defining an OpenTelemetry Collector #4313
base: main
Are you sure you want to change the base?
add document defining an OpenTelemetry Collector #4313
Conversation
Related to open-telemetry#4309 Signed-off-by: Alex Boten <[email protected]>
specification/collector/README.md
Outdated
|
||
- An OpenTelemetry Collector _MUST_ accept a OpenTelemetry Collector Config file. | ||
- An OpenTelemetry Collector _MUST_ be able to be compiled with any and all | ||
additional Collector plugins that the user wishes to include. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should this specification define what an OpenTelemetry Collector plugin is? Is it any component of type receiver, processor, exporter, extension, or config map provider?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Specification needs to define all the terms used in this definition, otherwise it does not remove ambiguity.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Renamed plugin to component and added a section to define OpenTelemetry Collector component.
Signed-off-by: Alex Boten <[email protected]>
Signed-off-by: Alex Boten <[email protected]>
Co-authored-by: Reiley Yang <[email protected]>
Signed-off-by: Alex Boten <[email protected]>
Co-authored-by: Reiley Yang <[email protected]>
specification/collector/README.md
Outdated
For a library to be considered an OpenTelemetry Collector component, it _MUST_ | ||
implement the [Component interface](https://github.com/open-telemetry/opentelemetry-collector/blob/main/component/component.go) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The Collector also accepts confmap.Provider
s and confmap.Converter
s, which do not accept this interface. Do we consider those out of scope?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think they do need to be considered in scope. Interoperability of those components is important.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The Collector also accepts confmap.Providers and confmap.Converters, which do not accept this interface. Do we consider those out of scope?
I wonder if including them would allow us to avoid having to include a definition for a config file, wdyt?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Providers handle configuration abstractly so they help remove the need for how configuration should be represented, but they don't solve the schema part (which I don't feel like we need to solve tbh)
The goal of this document is for users to be able to easily switch between | ||
OpenTelemetry Collector Distros while also ensuring that components produced by | ||
the OpenTelemetry Collector SIG are able to work with any vendor who claims | ||
support for an OpenTelemetry Collector. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not sure I understand this goal. If a vendor produces a collector distribution that has a subset of available components because those are the components relevant to their service offerings and that they're willing to support, where do any other components (whether hosted in an OTel repo or not) fit into that picture? Do we mean that a distribution must offer end users the ability to modify its source and create their own build? We should be explicit about that if that is the case.
Given that the licensing of the collector's source code does not require that distribution of derivative works happen in source form I'm not sure that we have much ability here to enforce such a requirement. We can certainly try to use the "OpenTelemetry" mark as a cudgel, but I'm not sure it'll be as effective as may be desirable since the terms "collector" and "distribution" are very broad. It could perhaps be argued that "OpenTelemetry Collector" is a protectable mark and maybe even that "Collector" has acquired secondary meaning in this limited scope, but protecting such a mark against genericization is going to be a Sisyphean task.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I see this definition as separation from the term Distribution
defined below. A Distribution is a specific compiled OpenTelemetry Collector with a specific set of OpenTelemetry Collector Components that the maintainer (the user in this case) decided to add. It is a OpenTelemetry Collector bc the maintainer was able to bring their chosen OpenTelemetry Collector components to it.
Something is not an OpenTelemetry Collector if it cannot support OpenTelemetry Collector Components. Maybe the word additional
below is unnecessary and could be removed?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Given that the licensing of the collector's source code does not require that distribution of derivative works happen in source form I'm not sure that we have much ability here to enforce such a requirement
We potentially have leverage over:
- Trademark usage if "OpenTelemetry Collector" becomes a trademark
- What we list on our registry and website and what we promote
- What wording can be used in 'official' OTel events
I think we have enough leverage here to make this worth it
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not sure that having a registered "OpenTelemetry Collector" mark is sufficient here as nominative fair use would allow anyone preparing a distribution (in the colloquial sense, not Distribution
however we seek to define it) to identify it as such. The Linux Foundation trademark usage guidelines also call out specifically this sort of usage as acceptable for indicating products are related to or based on the project that produces the product bearing their marks.
Obviously the project can control what it puts on its website and what marketing collateral is used in conjunction with events operated by LF/CNCF, but that doesn't seem like effective leverage over an actor who has no need or interest in such things.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I would generally say if someone isn't interested in "playing nice" then it doesn't really matter what we say or what we don't say. The solution to enforceable marks is offering certification and conformance suites that are attached to actual trademarks (e.g., "OTLP Inside" or whatever). This document is guidance for the community as much as it is guidance for external parties.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What @austinlparker said along with my thinking that its important to define this, and include this requirement, to make clear why opentelemetry.io would or wouldn't list project Y as a Collector or Distribution.
specification/collector/README.md
Outdated
For a library to be considered an OpenTelemetry Collector component, it _MUST_ | ||
implement the [Component interface](https://github.com/open-telemetry/opentelemetry-collector/blob/main/component/component.go) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think they do need to be considered in scope. Interoperability of those components is important.
Co-authored-by: Anthony Mirabella <[email protected]>
to: collector/README.md | ||
---> | ||
|
||
# OpenTelemetry Collector |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
OpenTelemetry Collector is never defined. Is it a source code artifact? A binary?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That's one of the things I was trying to get at here. Since there's no binary plugin mechanism it seems that the source would need to be available for it to be extended in the manner contemplated, but that's not clear or explicit in the current state.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Since there's no binary plugin mechanism it seems that the source would need to be available for it to be extended in the manner contemplated, but that's not clear or explicit in the current state.
Is the lack of binary plugin mechanism something that the OpenTelemetry Collector SIG wants to solve? Are there technical blockers?
Binary and dynamic loading plugin seem to be an established pattern. For example:
-
https://docs.fluentbit.io/manual/administration/configuring-fluent-bit/yaml/plugins-section
While Fluent Bit comes with a variety of built-in plugins, it also supports loading external plugins at runtime. This feature is especially useful for loading Go or Wasm plugins that are built as shared object files (.so).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The Fluent Bit example is not necessarily apposite as it involves a C application dynamically loading shared libraries built with the Go toolchain using CGO (which is generally prohibited in the Collector codebase).
Go does have a native plugin mechanism, though it comes with many caveats and is widely regarded as a bad idea that can't be dropped due to compatibility guarantees. Its documentation sums up its litany of restrictions in this way, which sounds a lot like a suggestion to use something like ocb
:
Together, these restrictions mean that, in practice, the application and its plugins must all be built together by a single person or component of a system. In that case, it may be simpler for that person or component to generate Go source files that blank-import the desired set of plugins and then compile a static executable in the usual way.
Signed-off-by: Alex Boten <[email protected]>
Signed-off-by: Alex Boten <[email protected]>
The goal of this document is for users to be able to easily switch between | ||
OpenTelemetry Collector Distros while also ensuring that components produced by | ||
the OpenTelemetry Collector SIG are able to work with any vendor who claims | ||
support for an OpenTelemetry Collector. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Given that the licensing of the collector's source code does not require that distribution of derivative works happen in source form I'm not sure that we have much ability here to enforce such a requirement
We potentially have leverage over:
- Trademark usage if "OpenTelemetry Collector" becomes a trademark
- What we list on our registry and website and what we promote
- What wording can be used in 'official' OTel events
I think we have enough leverage here to make this worth it
Co-authored-by: Anthony Mirabella <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
two minor suggestions, otherwise this looks great! Thank you Alex!
Co-authored-by: Jacob Aronoff <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you Alex 🙇
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks Alex, this definition really captures the heart of it. A+
Signed-off-by: Alex Boten <[email protected]>
Overall LGTM, but I suggest there's a (minor) clarification around this comment:
As the bottom part mentions that, for vendor distros, this is a SHOULD instead. |
Signed-off-by: Alex Boten <[email protected]>
Updated should to must, PTAL |
of an OpenTelemetry Collector with a specific set of components and features. A | ||
Distribution author _MUST_ provide users with tools and/or documentation for adding | ||
their own components to the Distribution's components. Note that the resulting |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This seems problematic to me as a MUST
. This is, in effect, a requirement that distributions be made available in source form with a license that permits modification (and presumably distribution, though that's not clear). The license under which the Collector is released does not require this and I'm suspicious of the ability to use trademark protections to prevent someone from using the phrasing "Foo Distribution for OpenTelemetry Collector" given that's literally the first "Correct" example in the Linux Foundation trademark guidelines and is a textbook case of nominative fair use.
I think if we want an identifier for compatible distributions that can be effectively controlled we will need a distinctive mark for a compatibility certification that can be granted to distributions that satisfy its requirements, similar to what @tedsuo seems to be describing here.
Beyond those concerns, this requirement also seems excessively vague. What qualifies as "tools and/or documentation"? Is a link to https://go.dev/dl/ sufficient? This probably requires a definition similar to "Corresponding Source" from AGPL-3, which again reinforces the limitations that come from not having this be part of the license under which the collector source code is made available.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The spirit here is to allow users to reuse their components when they move from one distribution to another: the engineering investments made should not be lost. If the distribution is open source and there's clear documentation how to add a new component to it, that's good enough for me. If the distribution is not open source but allows me to enter the Go module name on a web interface somewhere and get a binary out, that's also fine.
I'd see that binary as "tainted" (to use the kernel terminology) and the final binary might not be officially supported (with SLAs) by a service provider, but as an end-user, I'm not locked in.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is a specification, so I don't think it's appropriate to leave ambiguity here and rely on interpretation of the "spirit" of the requirement. This was changed from SHOULD
to MUST
in response to a comment seeking clarification that we intended to require distributions to allow users to modify their source and build new, modified, binaries.
If the distribution is not open source but allows me to enter the Go module name on a web interface somewhere and get a binary out, that's also fine.
I would not expect, and do not think it reasonable to expect, that any vendor offering a closed-source, binary-only distribution will allow users to provide arbitrary code to be built into a new "tainted" binary by that vendor. Doing so would allow for a user to cause a vendor to distribute binaries built from code licensed under terms the vendor has no opportunity to review and which may require, for instance, that any code it is compiled with be distributed under the same terms.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is a specification, so I don't think it's appropriate to leave ambiguity here and rely on interpretation of the "spirit" of the requirement
I agree, my comment was more to provide the background, hoping that it would trigger ideas for a new wording.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is defining what OpenTelemetry considers a distribution. To be considered a distribution by the project I'd think there is free reign over restrictions. That is different from trademark which would mean the project could actually stop someone else from saying, "This is an Otel Collector Distribution". So this wouldn't offer that protection, but instead define for others what the project will itself consider and call a distribution.
I'd still support rephrasing this to not requiring docs/tooling if it works with Otel docs and tooling. Which may mean "requiring" the documentation of all components within a distribution (otherwise how else would a user define an equivalent ocb configuration).
The goal of this document is for users to be able to easily switch between | ||
OpenTelemetry Collector Distros while also ensuring that components produced by | ||
the OpenTelemetry Collector SIG are able to work with any vendor who claims | ||
support for an OpenTelemetry Collector. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not sure that having a registered "OpenTelemetry Collector" mark is sufficient here as nominative fair use would allow anyone preparing a distribution (in the colloquial sense, not Distribution
however we seek to define it) to identify it as such. The Linux Foundation trademark usage guidelines also call out specifically this sort of usage as acceptable for indicating products are related to or based on the project that produces the product bearing their marks.
Obviously the project can control what it puts on its website and what marketing collateral is used in conjunction with events operated by LF/CNCF, but that doesn't seem like effective leverage over an actor who has no need or interest in such things.
to: collector/README.md | ||
---> | ||
|
||
# OpenTelemetry Collector |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The Fluent Bit example is not necessarily apposite as it involves a C application dynamically loading shared libraries built with the Go toolchain using CGO (which is generally prohibited in the Collector codebase).
Go does have a native plugin mechanism, though it comes with many caveats and is widely regarded as a bad idea that can't be dropped due to compatibility guarantees. Its documentation sums up its litany of restrictions in this way, which sounds a lot like a suggestion to use something like ocb
:
Together, these restrictions mean that, in practice, the application and its plugins must all be built together by a single person or component of a system. In that case, it may be simpler for that person or component to generate Go source files that blank-import the desired set of plugins and then compile a static executable in the usual way.
This PR was marked stale due to lack of activity. It will be closed in 7 days. |
[Collector components](#opentelemetry-collector-components) that | ||
the user wishes to include. | ||
|
||
## OpenTelemetry Collector configuration file |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this section really required when it is already defined above that it must accept an OpenTelemetry Collector configuration?
Besides redundancy it makes this document a living document that would have to remember to be updated if a new top level key is ever added to the collector configuration file -- for the "minimum structure".
implement a [Component interface](https://pkg.go.dev/go.opentelemetry.io/collector/component#Component) | ||
defined by the OpenTelemetry Collector SIG. | ||
|
||
Components require a unique identfier as a `type` string to be included in an OpenTelemetry |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Similar concern as above when you could just say it must implement the Component interface. And is this paragraph referring to the type ID
in Component
which says:
The component ID (combination type + name) is unique for a given component.Kind.
So multiple components can use the same identifier if they are of different Kind
s?
Changes
Adds a definition of an OpenTelemetry Collector