Split connection management away from exporter (#1369)
* Split protocol handling away from exporter
This commits adds a ProtocolDriver interface, which the exporter
will use to connect to the collector and send both metrics and traces
to it. That way, the Exporter type is free from dealing with any
connection/protocol details, as this business is taken over by the
implementations of the ProtocolDriver interface.
The gRPC code from the exporter is moved into the implementation of
ProtocolDriver. Currently it only maintains a single connection,
just as the Exporter used to do.
With the split, most of the Exporter options became actually gRPC
connection manager's options. Currently the only option that remained
to be Exporter's is about setting the export kind selector.
* Update changelog
* Increase the test coverage of GRPC driver
* Do not close a channel with multiple senders
The disconnected channel can be used for sending by multiple
goroutines (for example, by metric controller and span processor), so
this channel should not be closed at all. Dropping this line closes a
race between closing a channel and sending to it.
* Simplify new connection handler
The callbacks never return an error, so drop the return type from it.
* Access clients under a lock
The client may change as a result on reconnection in background, so
guard against a racy access.
* Simplify the GRPC driver a bit
The config type was exported earlier to have a consistent way of
configuring the driver, when also the multiple connection driver would
appear. Since we are not going to add a multiple connection driver,
pass the options directly to the driver constructor. Also shorten the
name of the constructor to `NewGRPCDriver`.
* Merge common gRPC code back into the driver
The common code was supposed to be shared between single connection
driver and multiple connection driver, but since the latter won't be
happening, it makes no sense to keep the not-so-common code in a
separate file. Also drop some abstraction too.
* Rename the file with gRPC driver implementation
* Update changelog
* Sleep for a second to trigger the timeout
Sometimes CI has it's better moments, so it's blazing fast and manages
to finish shutting the exporter down within the 1 microsecond timeout.
* Increase the timeout for shutting down the exporter
One millisecond is quite short, and I was getting failures locally or
in CI:
go test ./... + race in ./exporters/otlp
2020/12/14 18:27:54 rpc error: code = Canceled desc = context canceled
2020/12/14 18:27:54 context deadline exceeded
--- FAIL: TestNewExporter_withMultipleAttributeTypes (0.37s)
otlp_integration_test.go:541: resource span count: got 0, want 1
FAIL
FAIL go.opentelemetry.io/otel/exporters/otlp 5.278s
or
go test ./... + coverage in ./exporters/otlp
2020/12/14 17:41:16 rpc error: code = Canceled desc = context canceled
2020/12/14 17:41:16 exporter disconnected
--- FAIL: TestNewExporter_endToEnd (1.53s)
--- FAIL: TestNewExporter_endToEnd/WithCompressor (0.41s)
otlp_integration_test.go:246: span counts: got 3, want 4
2020/12/14 17:41:18 context canceled
FAIL
coverage: 35.3% of statements in ./...
FAIL go.opentelemetry.io/otel/exporters/otlp 4.753s
* Shut down the providers in end to end test
This is to make sure that all batched spans are actually flushed
before closing the exporter.
2020-12-21 22:49:45 +02:00
|
|
|
// Copyright The OpenTelemetry Authors
|
|
|
|
//
|
|
|
|
// Licensed under the Apache License, Version 2.0 (the "License");
|
|
|
|
// you may not use this file except in compliance with the License.
|
|
|
|
// You may obtain a copy of the License at
|
|
|
|
//
|
|
|
|
// http://www.apache.org/licenses/LICENSE-2.0
|
|
|
|
//
|
|
|
|
// Unless required by applicable law or agreed to in writing, software
|
|
|
|
// distributed under the License is distributed on an "AS IS" BASIS,
|
|
|
|
// WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
|
|
|
|
// See the License for the specific language governing permissions and
|
|
|
|
// limitations under the License.
|
|
|
|
|
|
|
|
package otlp // import "go.opentelemetry.io/otel/exporters/otlp"
|
|
|
|
|
|
|
|
import (
|
|
|
|
"context"
|
2020-12-22 19:21:45 +02:00
|
|
|
"sync"
|
Split connection management away from exporter (#1369)
* Split protocol handling away from exporter
This commits adds a ProtocolDriver interface, which the exporter
will use to connect to the collector and send both metrics and traces
to it. That way, the Exporter type is free from dealing with any
connection/protocol details, as this business is taken over by the
implementations of the ProtocolDriver interface.
The gRPC code from the exporter is moved into the implementation of
ProtocolDriver. Currently it only maintains a single connection,
just as the Exporter used to do.
With the split, most of the Exporter options became actually gRPC
connection manager's options. Currently the only option that remained
to be Exporter's is about setting the export kind selector.
* Update changelog
* Increase the test coverage of GRPC driver
* Do not close a channel with multiple senders
The disconnected channel can be used for sending by multiple
goroutines (for example, by metric controller and span processor), so
this channel should not be closed at all. Dropping this line closes a
race between closing a channel and sending to it.
* Simplify new connection handler
The callbacks never return an error, so drop the return type from it.
* Access clients under a lock
The client may change as a result on reconnection in background, so
guard against a racy access.
* Simplify the GRPC driver a bit
The config type was exported earlier to have a consistent way of
configuring the driver, when also the multiple connection driver would
appear. Since we are not going to add a multiple connection driver,
pass the options directly to the driver constructor. Also shorten the
name of the constructor to `NewGRPCDriver`.
* Merge common gRPC code back into the driver
The common code was supposed to be shared between single connection
driver and multiple connection driver, but since the latter won't be
happening, it makes no sense to keep the not-so-common code in a
separate file. Also drop some abstraction too.
* Rename the file with gRPC driver implementation
* Update changelog
* Sleep for a second to trigger the timeout
Sometimes CI has it's better moments, so it's blazing fast and manages
to finish shutting the exporter down within the 1 microsecond timeout.
* Increase the timeout for shutting down the exporter
One millisecond is quite short, and I was getting failures locally or
in CI:
go test ./... + race in ./exporters/otlp
2020/12/14 18:27:54 rpc error: code = Canceled desc = context canceled
2020/12/14 18:27:54 context deadline exceeded
--- FAIL: TestNewExporter_withMultipleAttributeTypes (0.37s)
otlp_integration_test.go:541: resource span count: got 0, want 1
FAIL
FAIL go.opentelemetry.io/otel/exporters/otlp 5.278s
or
go test ./... + coverage in ./exporters/otlp
2020/12/14 17:41:16 rpc error: code = Canceled desc = context canceled
2020/12/14 17:41:16 exporter disconnected
--- FAIL: TestNewExporter_endToEnd (1.53s)
--- FAIL: TestNewExporter_endToEnd/WithCompressor (0.41s)
otlp_integration_test.go:246: span counts: got 3, want 4
2020/12/14 17:41:18 context canceled
FAIL
coverage: 35.3% of statements in ./...
FAIL go.opentelemetry.io/otel/exporters/otlp 4.753s
* Shut down the providers in end to end test
This is to make sure that all batched spans are actually flushed
before closing the exporter.
2020-12-21 22:49:45 +02:00
|
|
|
|
|
|
|
metricsdk "go.opentelemetry.io/otel/sdk/export/metric"
|
|
|
|
tracesdk "go.opentelemetry.io/otel/sdk/export/trace"
|
|
|
|
)
|
|
|
|
|
|
|
|
// ProtocolDriver is an interface used by OTLP exporter. It's
|
|
|
|
// responsible for connecting to and disconnecting from the collector,
|
|
|
|
// and for transforming traces and metrics into wire format and
|
|
|
|
// transmitting them to the collector.
|
|
|
|
type ProtocolDriver interface {
|
|
|
|
// Start should establish connection(s) to endpoint(s). It is
|
|
|
|
// called just once by the exporter, so the implementation
|
|
|
|
// does not need to worry about idempotence and locking.
|
|
|
|
Start(ctx context.Context) error
|
|
|
|
// Stop should close the connections. The function is called
|
|
|
|
// only once by the exporter, so the implementation does not
|
|
|
|
// need to worry about idempotence, but it may be called
|
|
|
|
// concurrently with ExportMetrics or ExportTraces, so proper
|
|
|
|
// locking is required. The function serves as a
|
|
|
|
// synchronization point - after the function returns, the
|
|
|
|
// process of closing connections is assumed to be finished.
|
|
|
|
Stop(ctx context.Context) error
|
|
|
|
// ExportMetrics should transform the passed metrics to the
|
|
|
|
// wire format and send it to the collector. May be called
|
|
|
|
// concurrently with ExportTraces, so the manager needs to
|
|
|
|
// take this into account by doing proper locking.
|
|
|
|
ExportMetrics(ctx context.Context, cps metricsdk.CheckpointSet, selector metricsdk.ExportKindSelector) error
|
|
|
|
// ExportTraces should transform the passed traces to the wire
|
|
|
|
// format and send it to the collector. May be called
|
|
|
|
// concurrently with ExportMetrics, so the manager needs to
|
|
|
|
// take this into account by doing proper locking.
|
|
|
|
ExportTraces(ctx context.Context, ss []*tracesdk.SpanSnapshot) error
|
|
|
|
}
|
2020-12-22 19:21:45 +02:00
|
|
|
|
|
|
|
// SplitConfig is used to configure a split driver.
|
|
|
|
type SplitConfig struct {
|
|
|
|
// ForMetrics driver will be used for sending metrics to the
|
|
|
|
// collector.
|
|
|
|
ForMetrics ProtocolDriver
|
|
|
|
// ForTraces driver will be used for sending spans to the
|
|
|
|
// collector.
|
|
|
|
ForTraces ProtocolDriver
|
|
|
|
}
|
|
|
|
|
|
|
|
type splitDriver struct {
|
|
|
|
metric ProtocolDriver
|
|
|
|
trace ProtocolDriver
|
|
|
|
}
|
|
|
|
|
|
|
|
var _ ProtocolDriver = (*splitDriver)(nil)
|
|
|
|
|
|
|
|
// NewSplitDriver creates a protocol driver which contains two other
|
|
|
|
// protocol drivers and will forward traces to one of them and metrics
|
|
|
|
// to another.
|
|
|
|
func NewSplitDriver(cfg SplitConfig) ProtocolDriver {
|
|
|
|
return &splitDriver{
|
|
|
|
metric: cfg.ForMetrics,
|
|
|
|
trace: cfg.ForTraces,
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
// Start implements ProtocolDriver. It starts both drivers at the same
|
|
|
|
// time.
|
|
|
|
func (d *splitDriver) Start(ctx context.Context) error {
|
|
|
|
wg := sync.WaitGroup{}
|
|
|
|
wg.Add(2)
|
|
|
|
var (
|
|
|
|
metricErr error
|
|
|
|
traceErr error
|
|
|
|
)
|
|
|
|
go func() {
|
|
|
|
defer wg.Done()
|
|
|
|
metricErr = d.metric.Start(ctx)
|
|
|
|
}()
|
|
|
|
go func() {
|
|
|
|
defer wg.Done()
|
|
|
|
traceErr = d.trace.Start(ctx)
|
|
|
|
}()
|
|
|
|
wg.Wait()
|
|
|
|
if metricErr != nil {
|
|
|
|
return metricErr
|
|
|
|
}
|
|
|
|
if traceErr != nil {
|
|
|
|
return traceErr
|
|
|
|
}
|
|
|
|
return nil
|
|
|
|
}
|
|
|
|
|
|
|
|
// Stop implements ProtocolDriver. It stops both drivers at the same
|
|
|
|
// time.
|
|
|
|
func (d *splitDriver) Stop(ctx context.Context) error {
|
|
|
|
wg := sync.WaitGroup{}
|
|
|
|
wg.Add(2)
|
|
|
|
var (
|
|
|
|
metricErr error
|
|
|
|
traceErr error
|
|
|
|
)
|
|
|
|
go func() {
|
|
|
|
defer wg.Done()
|
|
|
|
metricErr = d.metric.Stop(ctx)
|
|
|
|
}()
|
|
|
|
go func() {
|
|
|
|
defer wg.Done()
|
|
|
|
traceErr = d.trace.Stop(ctx)
|
|
|
|
}()
|
|
|
|
wg.Wait()
|
|
|
|
if metricErr != nil {
|
|
|
|
return metricErr
|
|
|
|
}
|
|
|
|
if traceErr != nil {
|
|
|
|
return traceErr
|
|
|
|
}
|
|
|
|
return nil
|
|
|
|
}
|
|
|
|
|
|
|
|
// ExportMetrics implements ProtocolDriver. It forwards the call to
|
|
|
|
// the driver used for sending metrics.
|
|
|
|
func (d *splitDriver) ExportMetrics(ctx context.Context, cps metricsdk.CheckpointSet, selector metricsdk.ExportKindSelector) error {
|
|
|
|
return d.metric.ExportMetrics(ctx, cps, selector)
|
|
|
|
}
|
|
|
|
|
|
|
|
// ExportTraces implements ProtocolDriver. It forwards the call to the
|
|
|
|
// driver used for sending spans.
|
|
|
|
func (d *splitDriver) ExportTraces(ctx context.Context, ss []*tracesdk.SpanSnapshot) error {
|
|
|
|
return d.trace.ExportTraces(ctx, ss)
|
|
|
|
}
|