wait_for_schema_agreement() too slow?

Resuming a discussion from @nyh and @Lorak-mmk from https://github.com/scylladb/scylladb/pull/23138 on why schema agreement may be slow (and for the sake of discussion, let's also assume we are dealing with a multiple AZ, multiple DC, large cluster, say 100 nodes):
1. There's a 0.2 seconds sleep between cycles of trying to get an agreed upon schema - https://github.com/scylladb/python-driver/blob/a4014093192a28f86a7fed789722e7f2b901f2a1/cassandra/cluster.py#L4268 - is that a real issue?
2. https://github.com/scylladb/python-driver/blob/a4014093192a28f86a7fed789722e7f2b901f2a1/cassandra/cluster.py#L4275 maybe is an overkill - creating an populating a dictionary from all peers, just to see if there's a mismatch? Not sure it's a real issue, but clearly could be done better
3. Is the query that _get_peers_query() is using ( see https://github.com/scylladb/python-driver/blob/a4014093192a28f86a7fed789722e7f2b901f2a1/cassandra/cluster.py#L4299 ) optimal for this use case? What do we need other than the schema_version ? Seem there are 3 options:
```
_SELECT_PEERS = "SELECT * FROM system.peers"
_SELECT_PEERS_NO_TOKENS_TEMPLATE = "SELECT host_id, peer, data_center, rack, rpc_address, {nt_col_name}, release_version, schema_version FROM system.peers"
_SELECT_SCHEMA_PEERS_TEMPLATE = "SELECT peer, host_id, {nt_col_name}, schema_version FROM system.peers"
```

I think (just from reading the code) we use _SELECT_SCHEMA_PEERS_TEMPLATE , which seems reasonable.

Anything else? Do we even have an issue?
    

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

wait_for_schema_agreement() too slow? #453

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

wait_for_schema_agreement() too slow? #453

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions