#27 Google ๐Ÿ†š OpenAI

These weeks so many things happened it got me overwhelmed ๐Ÿ˜ฐ. Felt like it's hard to stay on top of the AI trend. When things just get bad, breathe and worry not. Then you can see the clear picture.

Let's dive in.

The highlights:

  1. OpenAI Chat GPT-4oโ€‹
    A new model capable of understanding voice, seeing things and flirting with you, with the voice of Scarlett Johanson. Note, the voice interaction is super fast, and it is literally a real time interaction. Best of all! It's FREE!
  2. Google Gemini IOโ€‹
    Veo - Sora concurrent (that is still just a demo, hidden behind a waitlist)
    Imagen 3 - A very interesting image builder, that inspires me to change image generator for Texti app.
    2 Million Tokens Privately for Gemini
    Ask Photos - Ask AI to find you a story about your pictures stored in google cloud.
    โ€‹Project Astra - the Star of the show! Multi-modal highly interactive demo, but this yet again just a demo ๐Ÿ˜…

โ€‹GPT-4o (omni)โ€‹

So OpenAI in their usual fashion have made everybody look like beginners. Google now looks like it's just playing around, and they really have no idea what they are doing.

One of the most fantastic things that OpenAI did, was they were just having fun with the AI model, and this means the product is enjoyable.

I truly believe that when a product is enjoyable, it's actually ready for people to use.

Live Translations - Spanish to English. The video does it all. It's fabulous and the speed of translation is acceptable. You could in theory rely on GPT-4o to get on a live chat with someone you don't speak the language.

twitter profile avatar
OpenAI
Twitter Logo
Twitter Logo
@OpenAI
9:39 PM โ€ข May 13, 2024
760
Retweets
4770
Likes
โ€‹

Be my eyes - this is probably my favourite one. You can now rely on your phone as a guide for blind people. Happy to hear that tech is used for people with disabilities!

Blindness is undoubtedly an impediment in life, yet I've seen people excel in life regardless of it. When tech can help someone overcome a problem, it makes me think we live in a world where people aren't just selfish.

twitter profile avatar
OpenAI
Twitter Logo
Twitter Logo
@OpenAI
9:39 PM โ€ข May 13, 2024
776
Retweets
4589
Likes
โ€‹

Multi-modal example. Let's keep it straight, this is cheats level god. You are now not protected by anything, you can just show a sheet of paper to gpt-4o and it will know how to read it and how to solve it properly. Purely โ€“ fantastic ๐Ÿช„

Also if you ever had issues to learn or understand something because there were no tutor around, then now you don't need to be afraid there's not going to be any possibilities. You can ask as many dumb questions as you want, no more shame in dumb questions. A robot does not need to care about feelings, it's all just pure information. The only necessary thing is a will to learn ๐Ÿ‘จโ€๐ŸŽ“

twitter profile avatar
OpenAI
Twitter Logo
Twitter Logo
@OpenAI
9:39 PM โ€ข May 13, 2024
679
Retweets
4369
Likes
โ€‹

There are many more examples of using GPT-4o in this thread on twitter, feel free to check them out!

An important thing to note, is that the voice of the assistant resembles heavily the voice of Scarlett Johanson, and she's suing them for that ๐Ÿ˜‚. But that's a story for another day.

Anyhow, the speed of the interaction, the fact that you can now speak over to cancel and re-direct the assistant into the right place is freaking fantastic. It feels pretty close to a human that you can chat with.


โ€‹Google I/O "24โ€‹

To begin with, we need to make sure what google is doing. Google is literally throwing mud to a wall and sees what sticks. It launches more projects that it can think of, and see what project reaches a desired interesting number for them. If it doesn't go well, it's just going to be thrown in the bin. The google graveyard is yet to see a bunch of new projects.

Veo - this as presented by google a mashup of Video Poet, WALT, Lumiere, Phenaki. It is supposed to learn from all, convert it into super powers and bring into light a product you'd love. However, this is all same as before. Just a demo, with no proof. They say they asked creators to use it, but we don't know what really lies behind it. For now all we can do is just say we're waiting for access to check it out on ourselves.

โ€‹Imagen 3 - google's answer to midjourney, but I mean we all know it's not even close. The UI/UX is great though, we should definitely take notes from this and try running some improvements for ourselves. Below you can see a prompt that normally would run on a model and see the output. The UX of the tags at the bottom really makes a difference, it does inspire you to try out different styles and contexts of the image generation.

The results though are fine, not exceptional or extremely creative by any means, but the prompt is as simple as it gets as well. Also the a very important note is that fingers are drawn correctly so that's also a decent output.

Feel free to try it out by yourself here: https://aitestkitchen.withgoogle.com/tools/image-fxโ€‹

One of the highlights of the I/O was and still is Project Astra, this is as everybody says these days ground breaking, wow, new and impressive. A multi-modal capable interface-enabled AI that would be able to help you with answering questions you don't have an answer to.

The core principle being as simple as. You look at it, ask about it, and it'll give you the answer straight on point. Check their demo bellow.

It's is very important, this is simply a demo, it's not necessary a product that is ready to be launched. It could be all be just a recorded and edited video just for the sake of AI hype to keep the investors happy. Google is known to run such "experiments" ๐Ÿ˜…

โ€‹

Concluding today's newsletter, I just want to remind you. Our computers can now see, percept our voice, and respond in a fun and interactive way. If this is ain't the future I don't know what is. Enjoy the moment! โค๏ธ


That's it folks, see you next week โค๏ธ๏ธ๏ธ๏ธ๏ธ๏ธ๏ธ

Happy Prompting!

โ€‹

Remember to invite your friends to subscribe at https://โ€‹newsletter.texti.appโ€‹

{{ address }}
โ€‹Unsubscribe ยท Preferencesโ€‹

ยฉ 2024 Texti Newsletter