Skip to content

feat(ntx-builder): blacklist accounts whose actors crash repeatedly#1712

Open
SantiagoPittella wants to merge 2 commits intosantiagopittella-ntx-builder-actor-deactivationfrom
santiagopittella-ntx-builder-account-blacklisting
Open

feat(ntx-builder): blacklist accounts whose actors crash repeatedly#1712
SantiagoPittella wants to merge 2 commits intosantiagopittella-ntx-builder-actor-deactivationfrom
santiagopittella-ntx-builder-account-blacklisting

Conversation

@SantiagoPittella
Copy link
Collaborator

Implements account blacklisting for the NTX Builder (4th task from #1694).

Track crash counts per account in the coordinator. When an actor shuts down due to a DbError, its crash count is incremented. Once it reaches a configurable threshold, the account is blacklisted and spawn_actor skips it.

Only DbError shutdowns count as crashes because other shutdown reasons (Cancelled, IdleTimeout, SemaphoreFailed) are either intentional or system-wide and not indicative of a per-account bug.

tracing::warn!(
%account_id,
crash_count = count,
"Account blacklisted due to repeated crashes, skipping actor spawn"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Q: should we make this more explicit? tracing::error!, since it signifies a bug in our impl or add "BUG" deliberately to the message?

return;
}
}

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Q: should we first check for running actors and terminate them?

assert_eq!(inactive_targets[0], inactive_id);
}

// BLACKLIST TESTS
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: we might want to find a different term to "blacklist", i.e. repeated_failure_block_list to avoid any confusion / fud

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants