As an example on what can go wrong when you give LLMs agency in your actions:
Replit ignores explicit instructions and drops production DB during change freeze leading to the loss of many records. The AI even acknowledges its wrongdoing. And this after the developer has made more things very explicit after a previous day's many 'lies' generated by the GenAI.