Tag: images

  • AI Generative Image Overview

    AI Generative Image Overview

    From Text to Pixels: How Generative AI Creates Images

    When you use generative AI to make an image, you’re working with a system that has been trained to recognize and rebuild visual patterns — not just to draw, but to recreate structure from noise.

    In text generation, the AI predicts the next word in a sequence based on patterns it has learned. For images, the data is pixels — millions of color values that form shapes, textures, and lighting patterns. The model learns from billions of training images, each converted into numbers that describe how pixels relate to each other. Over time, it builds statistical “maps” of what things like trees, faces, or clouds tend to look like.

    The Diffusion Process: Learning to Remove Noise

    During training, the model is given an image and taught to add random noise until the image becomes pure static. Then it learns the reverse process — how to take that noisy image and gradually remove the noise to recover the original picture.

    By repeating this millions of times, the model learns a general rule:

    Starting with random noise, here’s how to remove the noise in a way that reveals something that looks like the images the model has been trained on.

    Human Input: Beyond the Data

    It’s important to understand that creating these models isn’t just about the billions of images. People are essential at every step.  Initially, vast datasets of images are gathered.  Human labelers often categorize these images and write descriptive captions – detailed text descriptions of what’s in the image (e.g., “a golden retriever playing fetch in a park”). These captions become crucial for connecting the visual content with language.  Furthermore, AI trainers are employed to fine-tune the models, evaluating their output and adjusting the training process.  Even seemingly simple tasks like verifying that images are not duplicates or that they are safe for public display require human oversight.  Finally, platforms like reCAPTCHA (or similar systems) often utilize human interaction to help distinguish real images from automatically generated ones, improving data quality.

    When you generate a new image, the AI starts with noise and applies that learned denoising process — guided by your text prompt. Each step removes a bit more noise, revealing colors and shapes that match your description. It doesn’t “copy” any one training image; instead, it uses what it has learned about visual structure to create a new combination. However, without the images it was trained on, the model could not be made.

    X-Y plot of algorithmically-generated AI art of European-style castle in Japan demonstrating DDIM diffusion steps
    “X-Y plot of algorithmically-generated AI art of European-style castle in Japan demonstrating DDIM diffusion steps” by Benlisquare is licensed under CC BY-SA 4.0.

    Prompting and Iteration

    Your prompt gives the AI a direction — it turns words into a kind of “map” that influences what it reveals during denoising.  The quality of the prompt directly impacts the quality of the image.

    Ethical Considerations: Thinking Critically About AI Image Generation

    AI image generation is a powerful technology with significant ethical implications. To use AI ethically, it’s crucial to understand these implications, both in how the AI is trained and in how it’s used.

    1. Bias in Training Data: AI learns from the data it’s fed. The massive datasets used to train image generation models are compiled from the internet, and the internet reflects existing societal biases. This means AI can also use and even amplify harmful stereotypes related to gender, race, age, ability, religion and more. For example, a prompt including “CEO” might disproportionately generate images of men in suits, reinforcing a biased perception of leadership. The humans involved in curating and labeling these datasets, as well as those who provide feedback to train the AI to avoid certain outputs (a process called human-in-the-loop reinforcement learning), also bring their own biases into the loop. Even seemingly neutral captions can subtly reinforce stereotypes.

    2. Copyright and Ownership: The images used to train these models are often copyrighted. While AI doesn’t “copy” images directly, there’s ongoing debate about whether the generated images infringe on the copyrights of the original artists.  The legal landscape is still changing, and it’s important to be aware of the potential copyright implications of using AI-generated images.  Think about how a prompt referencing a specific artist’s style raises these issues.

    While there are potential copyright issues with how models are trained, at the current time, images generated by AI re not copyrightable. This means that if you’re doing work that you or a client wants to copyright, you should not use AI generated images.

    3. Misuse and Potential Harm: AI image generation can be misused to create deepfakes, spread misinformation, or generate harmful content. It’s essential to consider the potential impact of your creations and to use this technology responsibly. 

    For example on the social media platform X, the Grok AI allowed people to edit photos other users had posted including allowing people other than the original poster to change photos so people wore revealing clothing. This caused public controversy including bans in certain countries and investigations by the State of California and the United Kingdom.

    Nnon-consensual, sexually explicit material is never appropriate and there are many other types of images that could be inappropriate including generating images that could be used to impersonate someone or to create false narratives. Think about the implications of the images you create with generative AI.

    4. Transparency and Accountability: It’s important to be transparent about the fact that an image was AI-generated. This helps to avoid misleading viewers and promotes accountability.  Consider adding a disclaimer when sharing AI-generated images, especially if they could be misinterpreted as real.

    Related Posts

  • Where to find free images

    Where to find free images

    When you use an image on a web site, you must obey copyright law and only use images you have an explicit right to use. So, searching Google Images is NOT the way to find them (unless you take additional steps shown below).

    What You Can Use

    The general rule of thumb is:

    Only use images where a specific license has been given to the image allowing you to use it.

    If you can’t find a license on the site where you see the image, don’t use the image.

    Here are some places and ways you can find images that you can use (often without paying money):

    Creative Commons

    This is actually a set of licenses, most of which are designed to allow you to use media (they can apply to other media) but that also come with restrictions like:

    • Attribution: you must credit the original author.
    • Share-alike: If you modify the image to create your own work (derivative) then you also have to share your creation with a CC license.
    • Non-commercial: you can’t make money off your use of the image.

    Creative Commons Search

    Stock Sites

    Most stock sites charge you money for the right to use their photos. when working on a professional production with a budget this is a great place to go and can get you relatively cheap images (less than paying for a photoshoot) that are high quality and fairly compensate the photographers.

    Sometimes there is not a budget, like on your class projects. These are sites with stock photography where either all or some are licensed to allow for use for free.

    Each of these sites have their own license for images and sometimes also have Creative Commons licensed images. When giving credit, look to see if there is a CC license and if not, you can write down Pexels License, or Unsplash License or Pixabay License depending on the site where you downloaded the image.

    Public Domain

    When an image is in the public domain it means it is free from all copyright restrictions and you are able to use it however you want. Still, it’s best to credit the creator if the person is known. This is similar to the CC0 license.

    Here are some sites with public domain images:

    Unsplash has partnered with some of the above institutions and more to provide pictures through their platform. Read more here, and see some of the collections below:

    Google Images

    Most Google Images can not be used. Google Images can only be used if you do the following:

    • Search for an image: https://images.google.com
    • Click Tools
    • Click the Usage Rights drop down and select one of the “Labeled for …” options
    • Double check that there is a Creative Commons license on the page where the image is located. Some images are listed incorrectly.
    • Use Common Sense: sometimes people will upload an image they did not create to a site like Flickr and add a CC license to the image. This is not legal and does not give you the right to use the image. Look at other images by the creator and see if it seems probable that they created it.