You sadly can't glean any informative information out of this experiment(other than bot z is better than bot y), as the hardware both engines are running on is not consistent.
If you want an answer to this question, you should use cutechess -cli. Along with running on the same hardware (and default settings!) this would allow you to generate a larger sample size of games.