38.
Can LLMs hack vulnerable apps? I spent the last week trying to find out!
Can LLMs hack vulnerable apps? I spent the last week trying to find out! I made a fake book review app and gave 15 models the APK with the goal: finding a person's private reviews. GPT 5.5 had the best success rate, DeepSeek V4 Pro solve