Note di Matteo

15 novembre 2025

TIL Nano Banana per la generazione di immagini AI non è un diffusion model ma autoregressive, a differenza delle generazioni precedenti di Imagen e a differenza di DALL-E 2 e 3. E Midjourney e Stable Diffusion.

Of note, gpt-image-1, the technical name of the underlying image generation model, is an autoregressive model. While most image generation models are diffusion-based to reduce the amount of compute needed to train and generate from such models, gpt-image-1 works by generating tokens in the same way that ChatGPT generates the next token, then decoding them into an image. It’s extremely slow at about 30 seconds to generate each image at the highest quality (the default in ChatGPT), but it’s hard for most people to argue with free.

In August 2025, a new mysterious text-to-image model appeared on LMArena: a model code-named “nano-banana”. This model was eventually publically released by Google as Gemini 2.5 Flash Image, an image generation model that works natively with their Gemini 2.5 Flash model. Unlike Imagen 4, it is indeed autoregressive, generating 1,290 tokens per image. After Nano Banana’s popularity pushed the Gemini app to the top of the mobile App Stores, Google eventually made Nano Banana the colloquial name for the model as it’s definitely more catchy than “Gemini 2.5 Flash Image”.

#154 /

20:57

/ #ai #google #openai

Inside Cursor

When people describe someone in a professional setting as “young,” I usually find this translates to either “somewhat incompetent” or “good at their job but gratingly unprofessional.” Knowing the former was not going to be an issue at Cursor, I was prepared for at least some of the latter.

Despite a young average age, I was pleasantly surprised to find the team instead to be warm, well-dressed, keen on eye contact, clear and respectful in communication, and assiduous about replacing empty toilet paper rolls on the dispenser of the shared bathrooms. I was also surprised to find people so young so often communicate their ideas by reference to Silicon Valley history, world history, pop culture, art, learnings from seemingly unrelated industries, and patterns they’ve observed in the work of others they’ve long admired. The range of references is wide, but what’s clear in every example is that people at Cursor study the world as they move through it, rather than rely exclusively on their own personal experience for all their context and idea-generation (a typical pitfall of “young” people). It makes the team particularly good at finding elegant solutions to many shapes of problems.

To share what they’re observing and learning, many team members create “brain” channels in Slack where they publish their personal musings; there’s no expectation of a response or engagement, but people with good ideas can command quite a following. For the most popular brain channels, the content has little to do with “proof of work” or “managing up,” but rather ideas and reflections. Recent examples include musings on whether “CMSes are an artifact of the pre-AI era,” a deeply considered readout from a slew of customer visits, and a very exacting friction log on a still-nascent Cursor product.

Perhaps most importantly to me, you won’t see much LFGGGGGG, talk of being “cracked,” or overuse of emojis or memes. Recent favorite non-work related messages include an invitation to Vivaldi’s The Four Seasons at the SF Symphony, a picture from respective NY and SF 9pm run clubs, friendly mockery at a bad take on AI in the The New Yorker, an entire channel dedicated to #laundry featuring a weekly “laundry standup” slackbot, debates about how to fold fitted sheets, and a poll about which humanoid robot will first make our beds. No one ever breaks character. By far, the most used reaction emoji is ♥️. No one raises their voices, gets angsty or flustered, or visibly panics when things go sideways. It all feels very…adult.

Brie Wolfson in Inside Cursor - Sixty days with the AI coding decacorn

#153 /

17:27

È quasi pronto IT Wallet nell'app IO, un passo ulteriore rispetto a Documenti su IO, come avevo scritto.

(da varie PR della repository GitHub)

#152 /

16:35

/ #digitalizzazione #italia #pagopa

TIL con un file .git-blame-ignore-revs si possono ignorare determinati commit dall'output "blame" su GitHub (o con git blame --ignore-revs-file .git-blame-ignore-revs), utile quando si fanno refactoring o si modifica la formattazione ma non la funzionalità.

(fonte, esempio)

#151 /

16:16

/ #dev #git

#150 /