I'm writing a follow-up to my blog post on soft takeoff and DSA, and I am looking for good examples of tech companies or academic research projects that are ~3+ years ahead of their nearest competitors in the technology(ies) they are focusing on.
Exception: I'm not that interested in projects that are pursuing some niche technology, such that no one else wants to compete with them. Also: I'm especially interested in examples that are analogous to AGI in some way, e.g. because they deal with present-day AI or because they have a feedback loop effect.
Even better would be someone with expertise on the area being able to answer the title question directly. Best of all would be some solid statistics on the matter. Thanks in advance!
Seems untrue to me, and I've benchmarked dozens of databases for dozens of problems.
In the column-store space (optimized for aggregate analytics... distributed execution of aggrgated queries, quick filtering based on ordering and data compression) Clickhouse is the best there is in my experience... I made that point 4 years ago, but now you can find plenty of other benchmarks for it. It's used by many large scale search engines and advertisers except google, and among others, by CERN.
In wide column storage space, and more broadly in the "heavy filtering, large amounts of data space" cassandra (, facebook) and now Scylla seem to lead. I've never had to put dozens of petabytes in a database, but the few people that do need this seem to agree.
In the transactional space I haven't seen anyone bring a significant gain over postgres and mariadb yet.
Kv store and in memory caching you have aerospike, rocksdb and stuff that's based on tikv more recently... All slightly different trade-offs, all open source. I'm not even aware of proprietary products here to be honest.
Those 4 combined cover most use cases a db has.
So, not saying I'm convinced I'm correct, but could you provide some examples to back up your claims ? Name some names, or, ideally, provide some uecases/domain where one could find benchmarks that demonstrate a proprietary database gas the upper hand.