A group of scientists from the Faculty of Information and Communication Technology as part of the CLARIN project decided to check whether artificial intelligence is really so versatile. In their experiment, they asked nearly 40,000 questions in 25 categories. They published the results of their observations in a scientific article.
ChatGPT interacts in a conversational way. The dialogue format makes it possible for ChatGPT to answer followup questions, admit its mistakes, challenge incorrect premises, and reject inappropriate requests.
Nobody has done that yet
The more difficult the tasks, the worse Chat GPT performed
According to dr Maciej Kawecki on tysol.pl, the Poles checked how he reacts to sarcasm, whether he understands jokes and is able to capture the broader context of statements. Unfortunately, the more difficult the tasks, the worse Chat GPT performed.
Chat GPT made mistakes that almost anyone would have noticed.