Gemini is once again breathing down ChatGPT’s neck: Google is launching a new capability in its artificial intelligence engine that allows direct creation of downloadable documents straight from the chat itself. Instead of copying text, reformatting it, or moving between apps, users can now generate a ready-to-use file, ranging from a Word document to a spreadsheet or a presentation.

What’s new?

The new feature, which is available to all Gemini users worldwide, allows the creation of a wide range of formats. Among other things, users can generate PDF files, Word documents in DOCX format, Excel spreadsheets in XLSX format, CSV files, Google Docs, Sheets and Slides, as well as plain text files, Markdown, and RTF. This means users no longer need to perform conversions or make manual adjustments after receiving content.

Google explains that the goal is to shorten the path between an initial idea and a final product. For example, it is now possible to request a budget proposal and receive it as a ready-made Excel file with tables, or to organize ideas into a structured document including bullet points, headings, and sub-sections. It is also possible to generate long summary documents and convert them into a ready-to-distribute PDF file.

This capability is based on deeper integration between Gemini and Google Workspace services, but it is not limited to them. Users can download the file directly to their computer or export it to Google Drive with a single click. This reduces the need for manual work across different systems.

Google (illustration).
Google (illustration). (credit: SHUTTERSTOCK)

Competition is heating up

However, this move places Gemini directly against major competitors, especially ChatGPT. ChatGPT also offers advanced document creation capabilities, but its approach is slightly different. ChatGPT allows the creation of structured and complete content, and can sometimes generate downloadable files in various formats through dedicated tools, including Word, Excel, PDF, or presentations.

The main difference lies in the user experience. While Gemini focuses on fast, direct file generation from within the chat as a built-in system feature, ChatGPT’s capabilities often rely on additional tools or on generating files as part of a more interactive process. In many cases, the user has broader control over document structure, including advanced customization, table creation, data analysis, and the generation of tailored files on demand.

At the same time, ChatGPT stands out particularly in its ability to work with complex files, process data, analyze existing spreadsheets, and generate new documents from them. It serves not only as a creation tool but also as an advanced work assistant for editing, reviewing, and improving existing documents. On the other hand, Gemini’s advantage at this stage is simplicity and speed: The user receives a ready-made file almost instantly, with minimal steps in between. This is an approach aimed primarily at users who want a fast and simple solution for everyday document creation.