And yet…ScyllaDB is famous for being a 10x faster rewrite of Cassandra (written in Java) in C++.
Your general comment is correct. I see it often with GPU algorithms which, no surprise, are also much faster on CPUs (using something like ISPC to compile them).
A performance improvement that could have been obtained by only rewriting in C++ the critical paths and integrate them via JNI, instead of rewriting the world.
An approach that tends to be ignored by those rewrite X in Y.