Yes, you don't get valid results by comparing badly-written programs. You only get valid results by comparing well-written programs. How well a program can be written, though, is limited by the language, and Java has myriad ways to encourage writing badly, so it's hard to find a real well-written Java program. It can be done, but when it's done you still end up with a Java program, so it's hardly worth the trouble.