“If you
go to Bing and you search for a bird, you get a bird picture. But here, the
pictures are created by the computer, pixel by pixel, from scratch,” explained
Xiaodong He, a principal researcher and research manager in the Deep Learning
Technology Center at Microsoft’s research lab in Redmond, Washington, in
Microsoft’s announcement. “These birds may not exist in the real world — they
are just an aspect of our computer’s imagination of birds.”
The bot
is able to generate a variety of images, researchers say, including everything
from “ordinary pastoral scenes,” like those with grazing livestock, to the
absurd – like “a floating double-decker
bus.”
Microsoft
says the bot was trained on datasets of paired images and captions, allowing it
to understand how to match up words to images. It learned to draw a bird, for
example, when the caption says “bird,” and it learned what a picture of a bird
should look like.
The
drawing bot’s technology consists of two machine learning models – one to
generate images from text descriptions, and another that uses text descriptions
to judge the authenticity of the generated images. The former tries to get fake
pictures past the latter, but the latter doesn’t want to be fooled. Combined,
the two work together to create higher-quality images.
What’s
especially interesting is how the bot can fill in the blanks when specific
details aren’t mentioned – basically, it has a little common sense and imagination
of its own, thanks to its training data. In the bird example, the bot will
usually draw a bird sitting on a tree branch even if that’s not stated in the
text, because the images originally fed to it often showed something similar.
Facebook,
too, has worked on teaching neural networks to make small images of things like
airplanes, cars and animals, and even creating your own Bitmoji-like avatar
from a photo.
Nvidia
researchers created computer-generated celebrities, using A.I.
It
suggests the bot could act as a sketch assistant for painters or interior
designers, or could be implemented as a tool for voice-activated photo
refinement.
Further
down the road, He says that, with more computing power, the technology could
possibly generate animated films from screenplays to reduce the manual labor
required today from animators.
But the
technology isn’t there yet.
taken from external source..
No comments:
Post a Comment