maybe they distilled claude for the flash version and not for the other hence better tool use and programming benchmarks