Get Even More Visitors To Your Blog, Upgrade To A Business Listing >>

AI Videos: Creepy, Boring, Remixed

Last week, OpenAI showed off Sora, a generative video engine that produces high-quality 60-second videos. The public reaction pattern redrew the same open-mouthed Ahs and Ohs around the world that DALL-E and ChatGPT provoked a bit over a year ago. Soon, AI videos will be as obvious, creepy, Boring, and economically worthless as AI images.

Remember when you saw the first DALL-E and ChatGPT-texts? How do you feel about them now? Are you still as impressed by AI images and ChatGPT text as you were a year ago? Do you think they are both boring and uncanny at the same time? Will the same happen to AI videos? Or has it already happened?

Very Recognizable, Very Creepy

Artificial text may not be as easily recognizable as computer-generated visuals. But in the meantime most of us now easily recognize AI images. The effect they have on us is the opposite of Ah and Oh. They’re both creepy and boring at the same time.

Rather than a practical way to easily add a good image to our articles, using AI imagery devalues our texts and makes us look cheap, dull, lazy, and a bit creepy. How come? Maybe it has to do with our knowledge of how they are made:

  • We know that it doesn’t take much time and energy to create them.
  • We know, feel, and recognize the type of errors that they come with.
  • We know that they are not meaningful by themselves, but recycled visual data that we fill with meaning.

We can recognize their faultiness just as well as the fake dinosaurs in the original Jurassic Park movie. At first, AI images seemed fresh and new, and somewhat uncanny. If you took a time machine to travel back thirty years, with a couple of AI images on a floppy disk, your 90’s friends would be stunned. Likely you could sell them for good money. Aphex Twin would be your first customer. You could make a really good deal selling AI-generated videos to Soundgarden.1

Like a 2024 Version of Black Hole Sun

In 1994 the American Rock Group released a surreal, dark song called Black Hole Sun. The lyrics can be interpreted in various ways. Some say they are about nuclear war. Some think Black Hole Sun is about taking heroin. Today, one could argue that it’s about AI images. Their author explains his lyrics as follows: “They’re just words.”

“I wrote it in my head driving home from Bear Creek Studio in Woodinville, a 35–40 minute drive from Seattle. It sparked from something a news anchor said on TV and I heard wrong. I heard ‘blah blah blah black hole sun blah blah blah’. I thought that would make an amazing song title, but what would it sound like?” –Chris Cornell, cit. in Songfacts

Examined from close, an academic might judge them as dark, surreal rock kitsch. The lyrics and their history could be bent into a prophecy on AI-generated content. What fully resonates without much bending are the aesthetics of the video.

The End is nigh: Happy colors, circus freaks celebrating the end of the world.

The diva and the muscle man: AI could have made that one.

Happy Horror: Expensively made with computers. Dead cheap today.

Ha ha ha: Staring at the empty screen.

The video looks just like today’s computer-generated content. It’s oversaturated, deformed, and scary in its lack of humanity and plausibility. It mixes the sweet and the brutal and shows scene after scene without any consideration for story or meaning. According to Howard Greenhalgh, the director of the video, it somewhat mimics the aesthetics of the opening scene of David Lynch’s Blue Velvet. All in all, it aims at coming across as “psychotic”. Greenhalgh called it a “horror cartoon”.

Sora: Aesthetically AI images and Videos often feel like deja-vus. Because they are. Some of them share some striking similarities with some artistic expressions of the nineties and early 2000s. They are oversaturated, distorted, and feel dead inside. Again, because they are.

Black Hole Sun: Soundgarden’s video has been produced to look “utterly psychotic”. At the time they had to reduce the mound of digitally distorted smiles because they were very expensive to create. That the band didn’t want the distorted smiles on themselves made the production economically achievable.

As boring as AI imagery has become, it is still as creepy as it was. One can only speculate why. Is it because we are trained to recognize fakeness, sickness, and death in human appearance? Or is it just not good enough just yet? The reason may have as much to do with the aesthetic as with our knowledge that humans didn’t create these images and videos.

Soundgarden’s video effects were also computer-generated. The difference is that their distortions were made with intention. AI-generated content is as it is because it lacks intention. Computer-generated images lack meaning, they lack emotion, and visually we can recognize that. AI images feel like a bad dream, a bad trip, a bad life, where everything looks real but nothing has any meaning. AI images show us things we somehow know but they feel empty and threaten to drag us in a black hole sun devoid of all sense. The more plausible the look, the scarier their emptiness feels.

Visual AI content will continue to creep us out, no matter how good the simulation becomes. The lack of emotion, intention, meaning, and story, the proximity of the cute and the infernal, the death in AI-generated eyes—all of that is not softened by its increasing verisimilitude. The horror becomes crisper with the growth in pixel density and technical perfection in sound, light, and physics. The crisper they become the clearer the psychosis they portray.

Sora: Cute puppies in slow motion are a good showcase because they slip by our reality filters.

Black Hole Sun: The sweet innocent lamb makes the horror even worse.

Soundgarden’s video has a cute lamb for contrast. Its cuteness makes the rest of the video even more creepy. Sora’s puppies look cute. Slow motion helps hide their fakeness. But in context, they have a similar effect as the lamb in Soundgarden’s famous video, they sweeten the psychotic horror…

Cute puppies are a good showcase because they have no facial expressions. We have a very sensitive recognition apparatus interpreting human motion and faces. It’s so hard to simulate human appearances. Cute puppies in slow motion are the opposite: We just have to like them.

No Feeling, No Meaning

“AI will get better and soon we can’t tell anymore.”

Technology gets better over time. So what? Chess computers have gotten so good at chess that we cannot tell who plays anymore. And yet the difference matters as much as ever. As soon as we know that a computer plays a chess game loses its meaning and becomes a technical exercise. Computers are good for training, but they haven’t made the game more interesting.

Computer-generated movies may get technically perfect. Maybe, one day, the FBI can’t tell the difference between Sora and Stanley Kubrick. But as soon as we’d know, the emptiness of its reality would suck us right back into its black hole. Just like computer chess. Why should I care on this side when no one cares on the other side?

“Everybody Makes Mistakes”

It’s not an issue that “AI” makes mistakes. We also make mistakes. The issue is what types of mistakes computers make. As soon as we notice that a visual mistake is not a sign of humanity but of its absence, something funny happens. Realizing “This is AI” is like finding out that we play chess against a computer. The game loses its purpose.

Three Examples

We make mistakes out of a partial deficiency of understanding or concentration. Processing errors often show their profound lack of understanding of basic reality. In the case of Soras example videos:

The Portuguese Dog

It looks colorful at first, but soon the story makes us worry about the dog. ‘It can’t possibly walk by the shutter without falling,’ we think. Somehow it walks across, easy. Reaching the other window it looks like wants to do another even more impossible jump. Luckily, the video ends before the inevitable fall. A cute dog in a world of beautiful colors turns into a scary story.

Bad idea: Please don’t try that, the shutter is in the way.

Huh? How is this possible? It’s not over though…

What? How did that happen? Please survive…

Oh no, not again! I hope the dog won’t try to…

The Woman in Tokyo

It looks plausible and fashionable at first. But then we notice the zebra lines on the floor, and how they point to the middle of the street. Then we see the chopped-up arrows and made-up Japanese writing on the signs. At this point, her slowly aging face with the dead eye behind the glasses starts giving us a weird vibe. Her constantly advancing age changes the story every three seconds. You may not notice (or have noticed) the aging, but you felt that something was off.

Going to the club: Late teens, early 20ties enjoys a night out out in Shibuya. With sunglasses, so she won’t see the bad zebras and Issey Miyake giants.

Ah, no, it’s a millennial… In her late twenties, or thirties… A foreigner wrote these signs 二んナ七? What does the second arrow mean?

So, she’s GenX? She could be 40, her skin ages right there, right now, but maybe she always was, and we just misinterpreted, because from far away we all look younger…

Okay… What is happening here? How did the clubbing teenager turn into a grandmother so fast?

The Gold Rush Zombie Horror

At some point, you will notice that all the half-horses and horse ghosts appear and disappear. And as you realize that it’s fake and unintentionally so. By the end of the clip, it starts looking like the opening of a Cowboy Zombie movie.

Gold Rush: Those were the days.

What is this? A ghost horse with two legs?

It disappears: Like ghost horses do…

Why? Because it’s a Western horse zombie movie opening.

These little details seem irrelevant in the light of the great technical quality, but they matter for our emotional perception. When horses dissolve the story changes from Gold Rush to The Walking Dead.

The Creepiness Comes From The Lack of Meaning

Computers Lack Understanding

Computers don’t know what they do. They have no comprehension or experience with the world that our languages, visual or verbal describe. They sustain no inner or outer relationship to the signs that they are merely ordered to statistically process. Thoughtlessly reusing computer-generated information you risk being judged accordingly, as someone who either lies or doesn’t know at all what they are saying or doing.

Knowing that information wasn’t meant but generated has the same effect on text, audio, and video as knowing that a chess move wasn’t thought but calculated. The difference in meaning between AI and humans is real. The result of 2*2 found by a pocket calculator or calculated in a human mind may numerically be the same. But it has a different meaning when we know that someone has felt the number 2 becoming the number 4.

We don’t need to see AI’s usual faults—like hands with six fingers, objects that don’t follow the law of physics—anymore to lose interest. We lose interest as soon as we realize that what we read never had any intended meaning. This is our natural reaction to lies and bullshit.

Lies and Other Misdemeanors

The value of content lies in what it means. If it wasn’t meant to express anything it simply doesn’t have the meaning of what you project into it. Unless you are an old-fashioned post-modernist, you know that, no matter how you interpret things, what something was intended to be, influences not just your interpretation… it matters to begin with. Expression without impression undermines meaning. Messages without meaning waste our time. We instinctively turn away from them.

We are trained to spot lies and fakery. Whether it’s a CGI Princess Leia, superman taking off, or the unlikely events in a fairytale, we have a well developed sensory to discern reality from fiction. If it is agreed upon fiction with a story we embrace the fake.

But realizing that what looks real at first is “something else” hooks into a deep fear. A fear of demons, ghosts, and sickness. As soon as we know that what pretends to be emotion is emptiness, trying to manipulate us, we turn away. This is our natural response to lies and trickery. Excluding the phony messenger from our world is a matter of survival.

It’s not just a lack of reality that creeps us out in computer generated images. When we watch a theatre piece or watch superhero movies we trade in the verisimilitude of the visuals for a good story. We forgive the unlikeliness of visuals and action for the sake of the story.

The creepiness doesn’t just come from the horror narrative that AI images and video inadvertently tell us either. If that was the narrative deal we’d happily accept it as entertainment. We watch creepy movies, we read scary stories and we enjoy them. What ultimately creeps us out is that all of this happens without any intended sense or meaning. It seems psychotic. AI content acts like psychotic liars that constantly change their story for no apparent reason.

Negative Value

While the visual horror of computer-generated content persists, the aesthetic value is fading by the second. There are so many AI-generated images out there and they all look more or less the same. The same aesthetic devaluation will hit AI videos. Until it becomes a negative asset.

Technically, crisp, well-made videos are expensive. They take a lot of time. They require a lot of people in the creation process. They cost a lot to produce. They look expensive. The outlook of being able to make technically high-quality videos in just a few seconds is attractive. But foreseeably, the very same inflation that has hit AI images will hit AI video. They will quickly beat the Pet Shop Boys at being boring.2 And then they will start to devalue what is connected to it.

Being Boring with AI: AI images are easily recognizeable. They have quickly become a cheap, empty cliché. Using AI images makes you look about as original, creative, and classy as those 90’s black-and-white images of half-naked people. If you’re a journalist, lawyer, or doctor, you better think twice before picking either for your blog.

Being Boring: Outtakes from the 1990 Pet Shop Boys Video with the same name. A the time these looked cool. In the meantime, we have seen so many half-naked neo-neo classicist supermodels that these images have become very boring.

Computer-generated images are becoming quicker and easier to make. Their quality is a lot better, now; they still do look fun… at first glance; they have started replacing stock images at a fast pace; and, like everything cheap and easy, they are losing their creative and economic value at the same pace as they have become ubiquitous. If they are easily recognizable, they already are a liability.

Compared to traditional video creation, generative video production will cost close to nothing and they will be produced at such an inflationary number that the technical glitz they now seem to offer will grow old within months.

The Can-You-Tell: Not all AI images are directly recognizable. This one is well done. You can still see it in the texture’s details, the strap of the guy’s backpack, and the continuity of the lanyard. With a bit of Photoshop, you could hide those issues, but it might still feel like a computer game on a PS6 somehow.

Take a step back: While there may be some great exceptions, we have quickly developed an ability to sense AI-generated images.

The very same economic inflation that hit computer-generated chess already applies to computer-generated text and computer-generated images. And this is exactly what will happen to computer-generated movies in no time. They will weigh on your credibility.

Original original: The original cover art of *Being Boring* by the Pet Shop Boys. Almost as boring as AI images, but written by humans for humans. Not creepy.

Original Remix: The Maxi single of *Being Boring* was titled *Boring Remixed*, it was not much more interesting than the regular track. But human after all.

Conclusion

Computer-generated videos impress at first sight, but soon they will follow the development of computer-generated text and images and carry close to zero stylistic, economic, or creative value until they become a liability and will cost more than they add, as they drag everything around them down.

You may get lucky and the computer produces an image, text, sound, or video that enriches your message. Or you may get unlucky and the computer produces some sexist, racist, or otherwise offensive, stupid, destructive, degrading message that shows you in an unintended, bad light. The typical aesthetic of AI imagery alone will make everything you do look cheap, careless, and dishonest. Whatever is cheap, fast, and omnipresent, doesn’t add any artistic, aesthetic or economic value. On the contrary.


  1. “While shooting ‘Black Hole Sun,’ he instructed his actors to do one thing: ‘Look basically fucking psychotic[…] The only stipulation the band’s members made was that, unlike the rest of the cast, they didn’t want to be wearing stupid grins. Instead, they played it straight-faced in front of a blue screen. The suggestion saved Greenhalgh some money—all the other actors’ exaggerated smiles needed to be digitally enhanced, and back then even primitive computer-generated effects were expensive. ‘Thank God Chris said he didn’t want to smile in it,’ the director said. ‘Because that would’ve made the postproduction budget massive.'” Source: “Look F**king Psychotic”: The Enduring Mystery of Soundgarden’s “Black Hole Sun” ↩

  2. If you are into pop culture you cannot be seriously offended when you read that the Pet Shop Boys are boring. Ironically, Being Boring was composed in response to Neil Tennant’s feeling hurt after a Japanese reviewer casually wrote that his band was generally considered uninteresting. “My feelings weren’t hurt. After [Pet Shop Boys co-founder] Chris Lowe and I performed at Tokyo’s Budokan arena in early July 1989, a Japanese reviewer wrote, “The Pet Shop Boys are often accused of being boring.” Being Boring – The path to a pop elegy. So he went and wrote a song that they were never being boring. If you still feel offended, you may unironically lick your wounds on The Guardian that will teach you “Why Pet Shop Boys’ Being Boring is the perfect pop song.” It’s still boring though. ↩

The post AI Videos: Creepy, Boring, Remixed appeared first on iA.



This post first appeared on Home | IA, please read the originial post: here

Share the post

AI Videos: Creepy, Boring, Remixed

×

Subscribe to Home | Ia

Get updates delivered right to your inbox!

Thank you for your subscription

×