Minor cleanup of collector and processor #76

metachris · 2025-07-03T07:44:30Z

📝 Summary

Making the code a little better structured and easier to understand and extend
Moved sending to tx receivers to after the sanity checks, which avoids sending duplicates and trash. The checks mostly need less than 100us (0.1ms), and at most up to 0.3ms (not a noticable delay).
Save transactions to Clickhouse after the inclusion check (instead of before), to avoid storing already included transactions

cmd/collect/main.go

collector/collector.go

collector/tx_processor.go

collector/clickhouse.go

Copilot

Pull Request Overview

Minor refactoring and cleanup to improve structure, avoid duplicate sends, and make Clickhouse writes thread-safe.

Refactored TxProcessor to send to receivers only after inclusion checks, added graceful shutdown flushing
Replaced standalone Start with a Collector struct that modularizes API and metrics server startup
Enhanced Clickhouse client with a mutex-protected batch, new save methods, and a FlushCurrentBatches API

Reviewed Changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
collector/tx_processor.go	Reordered transaction flow, merged clickhouse handling, added `Shutdown`, renamed methods/fields
collector/collector.go	Introduced `Collector` type, modularized `StartAPIServer`/`StartMetricsServer`, added `Shutdown`
collector/clickhouse.go	Added `batchLock` for thread safety, refactored batch send logic, implemented `FlushCurrentBatches`
cmd/collect/main.go	Updated collector initialization in `main`, added default `"all"` value for allowed sources flag
api/server.go	Downgraded subscriber removal log from Info to Debug
api/handler.go	Noted potential DoS risk in SSE handler

Comments suppressed due to low confidence (2)

cmd/collect/main.go:174

[nitpick] Variable name 'collector' shadows the imported package 'collector'. Consider renaming the local instance (e.g., svc or col) to avoid shadowing.

	collector := collector.New(collector.CollectorOpts{

collector/clickhouse.go:92

The new concurrent batching and flush logic should be covered by unit tests to verify correct behavior under parallel calls and on FlushCurrentBatches.

func (ch *Clickhouse) AddTransaction(tx common.TxIn) error {

Copilot · 2025-07-04T13:16:10Z

collector/clickhouse.go

+	ch.saveTransactionBatch(ch.currentTxBatch)
+	ch.saveSourcelogs(ch.currentSourcelogBatch)
+	ch.batchLock.Unlock()


[nitpick] Holding batchLock while synchronously calling saveTransactionBatch and saveSourcelogs may block other producers if retries take long. Consider releasing the lock before dispatching or performing the sends in separate goroutines.

Suggested change

ch.saveTransactionBatch(ch.currentTxBatch)

ch.saveSourcelogs(ch.currentSourcelogBatch)

ch.batchLock.Unlock()

txBatch := ch.currentTxBatch

sourcelogBatch := ch.currentSourcelogBatch

ch.batchLock.Unlock()

go func() {

ch.saveTransactionBatch(txBatch)

}()

go func() {

ch.saveSourcelogs(sourcelogBatch)

}()

Copilot · 2025-07-04T13:16:11Z

collector/tx_processor.go

-		receiversAllowedSources: opts.ReceiversAllowedSources,
-		clickhouseDSN:           opts.ClickhouseDSN,
+func (p *TxProcessor) Shutdown() {
+	p.log.Info("Shutting down TxProcessor ...")


Shutdown flushes Clickhouse batches but does not stop the transaction receiver loop, which ranges over p.txC and will never exit. Consider closing txC or adding a shutdown signal to stop that goroutine.

Suggested change

p.log.Info("Shutting down TxProcessor ...")

p.log.Info("Shutting down TxProcessor ...")

close(p.done) // signal shutdown

close(p.txC) // close the transaction channel

Copilot · 2025-07-04T13:16:11Z

api/handler.go

+			// Note/TODO: a client with a slow connection may cause blocking other clients and cause DoS on all receivers
 			fmt.Fprintf(w, "data: %s\n\n", tx)
 			w.(http.Flusher).Flush() //nolint:forcetypeassert


[nitpick] Relying on a single channel and direct writes risks DoS if one subscriber is slow. Consider per-subscriber buffering or non-blocking writes with timeouts to isolate slow clients.

Suggested change

// Note/TODO: a client with a slow connection may cause blocking other clients and cause DoS on all receivers

fmt.Fprintf(w, "data: %s\n\n", tx)

w.(http.Flusher).Flush() //nolint:forcetypeassert

// Perform non-blocking write with timeout to prevent slow clients from causing DoS

err := writeWithTimeout(w, tx)

if err != nil {

s.log.Warnf("Failed to write to subscriber %s: %v", subscriber.uid, err)

s.removeSubscriber(&subscriber)

return

}

Cleanup collector and processor a bit

2a2bf82

metachris requested review from Copilot, dvush and ilyaluk July 3, 2025 07:44

This comment was marked as outdated.

Sign in to view

metachris added 2 commits July 3, 2025 09:58

write tx to clickhouse after inclusion check

14320b7

more cleanup

90d514d

metachris force-pushed the cleanup branch from ff95bb0 to 90d514d Compare July 3, 2025 08:33

ilyaluk reviewed Jul 3, 2025

View reviewed changes

cmd/collect/main.go Outdated Show resolved Hide resolved

collector/collector.go Show resolved Hide resolved

collector/tx_processor.go Outdated Show resolved Hide resolved

collector/tx_processor.go Outdated Show resolved Hide resolved

metachris added 2 commits July 3, 2025 19:37

move receiversAllowAllSources logic into processor only

b235d67

simplified sendTxToReceivers txAllowed

8af5ad5

metachris requested a review from Copilot July 4, 2025 08:38

Clickhouse.FlushCurrentBatches

92db07a

metachris force-pushed the cleanup branch from 45afe80 to 92db07a Compare July 4, 2025 08:39

This comment was marked as outdated.

Sign in to view

ilyaluk reviewed Jul 4, 2025

View reviewed changes

collector/clickhouse.go Outdated Show resolved Hide resolved

ilyaluk approved these changes Jul 4, 2025

View reviewed changes

metachris force-pushed the cleanup branch 2 times, most recently from b179375 to f9bbbb4 Compare July 4, 2025 13:07

fix locking issue

223e9fe

metachris force-pushed the cleanup branch from f9bbbb4 to 223e9fe Compare July 4, 2025 13:08

metachris requested a review from Copilot July 4, 2025 13:14

Copilot AI reviewed Jul 4, 2025

View reviewed changes

metachris merged commit 4c38400 into main Jul 6, 2025
2 checks passed

metachris deleted the cleanup branch July 6, 2025 07:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minor cleanup of collector and processor #76

Minor cleanup of collector and processor #76

Uh oh!

metachris commented Jul 3, 2025 •

edited

Loading

Uh oh!

This comment was marked as outdated.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jul 4, 2025

Uh oh!

Copilot AI Jul 4, 2025

Uh oh!

Copilot AI Jul 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

-	ch.saveTransactionBatch(ch.currentTxBatch)
-	ch.saveSourcelogs(ch.currentSourcelogBatch)
-	ch.batchLock.Unlock()
+	txBatch := ch.currentTxBatch
+	sourcelogBatch := ch.currentSourcelogBatch
+	ch.batchLock.Unlock()
+	go func() {
+		ch.saveTransactionBatch(txBatch)
+	}()
+	go func() {
+		ch.saveSourcelogs(sourcelogBatch)
+	}()

-			// Note/TODO: a client with a slow connection may cause blocking other clients and cause DoS on all receivers
-			fmt.Fprintf(w, "data: %s\n\n", tx)
-			w.(http.Flusher).Flush() //nolint:forcetypeassert
+			// Perform non-blocking write with timeout to prevent slow clients from causing DoS
+			err := writeWithTimeout(w, tx)
+			if err != nil {
+				s.log.Warnf("Failed to write to subscriber %s: %v", subscriber.uid, err)
+				s.removeSubscriber(&subscriber)
+				return
+			}

Minor cleanup of collector and processor #76

Minor cleanup of collector and processor #76

Uh oh!

Conversation

metachris commented Jul 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📝 Summary

Uh oh!

This comment was marked as outdated.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Jul 4, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jul 4, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jul 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

metachris commented Jul 3, 2025 •

edited

Loading