Tag: openai
-
What I learned from ChatGPT-40’s launch event

ChatGPT-4o is here. The “o” stands for omni, hinting at the combination of text, audio, video and image outputs that the latest version of ChatGPT offers. OpenAI CEO Sam Altman posted on X that GPT-4o is “natively multimodal” with its native combination of voice, text and vision. At the launch livestream of GPT-4o earlier this…
-
ElevenLabs (Product Review)

My summary of Eleven Labs before using it – ElevenLabs turns text into voice. How does ElevenLabs explain itself in the first minute? On its website ElevenLabs explains that its users can create natural AI voices instantly in any language. How does ElevenLabs work? I select a language – English – and sample text is…
-
Learning about video and generative AI (2)

Last week I wrote about the arrival of Sora and this week I’ll cover two new contributions to the field of video and AI: Meta’s “V-JEPA” method and Alibaba’s “EMO” model. EMO With EMO (Emote Portrait Alive – why wasn’t it called “EPO”?!), users can create singing or talking videos, based off a static image…
-
Learning about video and generative AI (1)

A couple of weeks ago, we saw and heard about two interesting breakthroughs in the world of AI generated content: Sora and V-JEPA. In this post I want to touch on SORA and its promise (and potential dangers) for the field of content production. Sora OpenAI has now introduced Sora, an AI model that generates…
