Texti Newsletter #16

Today into the highlights of the week, something I was not expecting to happen that soon! This thing is really incredible and I was really hyped when google announced it, but they lack behind their product managers.

The highlights of the week:

  1. Next Generation Text2Image AI is finally here!
  2. Elon Musk Sues OpenAI ๐Ÿคฏ๐Ÿคฆ
  3. Make it sing!

โ€‹Fast SDXL AIโ€‹

TLDR:

This is gonna blow your mind ๐Ÿคฏ, it generates images as you type!!!

Long Story:

I don't think I can type a better story that the TLDR, I mean it freaking generates the image on the fly! Yes it isn't perfect by any means, but holy it's done on the FLY!!!
โ€‹Just 3 weeks ago I mentioned and showed google's video demo about this, but now this is a hands-on experience, where you go to: https://fastsdxl.ai/ and start typing.
Feel free to check-out the following video. It is not sped-up it, it's real time.

โ€‹

If you think why does it impress me so much? Well there's a simple reason. And that this eases the learning curve dramatically. Instead of waiting for image to show up for a minute or so, you just start typing and voila the image is here!
โ€‹
Damn this is impressive, I just can't get over it! Please try it out! https://fastsdxl.ai/โ€‹


Elon Musk is suing OpenAI

TLDR:

Elon is not happy OpenAI is not so open as they seem be so "open". And I think Elon is correct here.

Long Story:

I didn't know this, but apparently Sam and Elon are good friends. You see back in 2015 the company was founded by Sam Altman, Greg Brockman, Reid Hoffman, Jessica Livingston, Peter Thiel, Elon Musk. These key people agreed that AI technology must be open source, and free to access by everybody. It also must have enabled the entire tech community to contribute and improve it.
However everything changed in 2019/2020 when they realized they sit on a massive well of gold ๐Ÿ†.

Elon Musk has had his own activities since so he wasn't really a part of the gang actively developing it, but he was donating money to OpenAI to maintain it's activity and development. A grand total of $44 million, was donated by Elon.

Since Microsoft has taken over OpenAI, it stopped being open, and more of a closed source company, that just shares an interface to use. This obviously makes Elon and the founding fathers a little unhappy. Because a mission once set, has dramatically drifted in a completely opposite direction.

I don't always say this, but I think Elon is correct here, because once an honorable mission is now a fluke of the past. Open source will eventually produce way more advancement and faster advancements than any closed source company, it is granted a little more dangerous, but everything has 2 sides, and a large gray area in the middle. Open Source community means no barriers to enter and try tinkering with it to deliver an "accidentally" fantastic output. I truly believe a company like this must be open source.

No matter how many employees you have, a community of 8 Billion people is still larger and doesn't need as many managers to build the future.


โ€‹

โ€‹AI Sync Lipโ€‹

TLDR:

A new research was just posted out, where an impressive conglomerate of Image2Image algorithms put together a model that can transform an image, to a video, where the face in the picture, could do sync lip the music in the background.

Long Story:

This week on twitter/X I stumbled across this incredible LLM that's capable of doing Image 2 Video and then Video 2 Sync Lip, based on an audio. Holy next level of deep fakes. I mean this is sooo cool!

If you access the research page, you can see an in depth explanation on how this is done. But in a nutshell what they do, is:

  1. Take an input image.
  2. Generate a bunch of variations of that image, with possible face gestures.
  3. Convert audio to a time line, of words and music, so that you can match the face gestures.
  4. align face gestures with the audio timeline
  5. Mash them together
  6. Output a video that leaves you out of the water.

I honestly believe there's a ton of potential in this. It is very similar to last's week Gen-Z math tutor, which made it incredibly cool.
Some ideas that pop-out immediately:

  1. Missed Connections: Imagine you want to talk to someone from the past, who has sadly passed away. You can just add a voice over, and the AI will take care of the rest, it'll make your relative almost alive.
  2. AI is sound: You've heard of fictional characters before, now they will actually be able to communicate. Imagine a twitch stream with a completely fictional character, where an AI plays a game and humans just watch it.
  3. Fictional Impressario: Absolutely fictional Holywood actors could now become a reality, including stunt doubles that were used before, now could be a thing of the past, and nobody gets hurt during a movie scene.
  4. Security Risks โš ๏ธ: this is kind of the other side of this power, you can impersonate sooo many people. I think we'll eventually have to have keywords instead of "hello" to make sure this person is actually a person, and not an AI, or somebody impersonating you.

Oh the world of the future is so different! It's so exciting!


That's it folks, see you next week โค๏ธ๏ธ๏ธ๏ธ๏ธ๏ธ๏ธ

Happy Prompting!

โ€‹

Remember to invite your friends to subscribe at https://โ€‹newsletter.texti.appโ€‹

{{ address }}
โ€‹Unsubscribe ยท Preferencesโ€‹

ยฉ 2024 Texti Newsletter