What makes Modjourney v4 so much Better?

Midjourney v4 dropped and along a lot of metrics it blows evetything else, including stable diffusion out of the water. In this video we explore the techniques used by midjourney to make their models better than everything else.

Discord: https://discord.gg/s8rVscu2pM

——– Links ——–

Midjourney V2 vs V3 comparison (very lit): https://github.com/willwulfken/MidJourney-Styles-and-Keywords-Reference/blob/main/Pages/Comparison_Pages/V1_vs_V2_vs_V3.md
David Holz interview: https://www.theregister.com/2022/08/01/david_holz_midjourney/
Other David Holz interview: https://www.theverge.com/2022/8/2/23287173/ai-image-generation-art-midjourney-multiverse-interview-david-holz

——– Music ——–

Music from https://freetousemusic.com

‘Onion’ by LuKremBo: https://www.youtube.com/watch?v=KGQNrzqrGqw
‘Snow’ by LuKremBo: https://www.youtube.com/watch?v=wYiNao04Wg0
‘Sunset’ by ‘LuKremBo’:https://www.youtube.com/watch?v=gv7hcXCnjOw
‘Affogato’ by LuKremBo: https://www.youtube.com/watch?v=YTUF1o9Sf3E

Many thanks to LuKremBo

#stablediffusion #aiart #news #art #midjourney #ai #technology #breakingnews #

21 Comments

  1. Sure, midjourney is currently "better" than stable diffusions

    But do you know one thing it cant do ?? well i guess you knew it already and that's the exact reason why i haven't tried mjv4 yet

  2. I think its important to give users creative freedom to make what they want, even if NSFW. Mainly because, without NSFW content allot of things just wouldn't exist. In the future there will be 2 big AI companies. 1 that has control over the image generation, that can limit the images to ethical SFW for professional api access, like what OpenAI and Imagen profess. Then there will be another that is completely open, for the degenerates to use.

  3. While I like the V4 quality I feel we will soon end up with DallE 3 fiasco where the model is closed and you are free to use the slow API … for money. That's why I thank God there is SD to equalize the market.

  4. So we are having Lexica as a source to pre generated images, if Lexica would add in a feature to upvote images it could help a lot to create a dataset that is based on user bias…
    Also I just had the idea to scrape Lexica for regularization images, it would probably help Dreambooth a lot if those were coming of of images generated from decent prompts. Rather than just using standard prompts?

  5. I had a discussion with some people on Discord about the inner workings of MidJourney a little while ago (this was still about V1, I believe). We all suspected that they were using the CC12M dataset and something similar to v-diffusion-pytorch by Catherine Crowson. In addition to that someone suggested that they likely added specific keywords to the prompt given by the users. So for example, if a user would write a prompt like "a white cat with a purple hat" then the Midjourney algo would perhaps add "trending on artstation". So what you mentioned about user feedback or more specifically 'ranking' feedback, could indeed indicate that the folks at MidJourney are trying to look for ideal keywords to add to the prompts.

  6. I was trying to do the same thing for stable diffusion. Even trying to incorporate prompt corrections that could just be handed over to emad. Doesn't seem to be enough interest though and automatic1111 wouldn't reply to it

  7. love the content, but isn’t a gan also an iterative process? Adversarial Network… one network generates images, another “judges” them and the process iterates. Or have I got that entirely wrong?

  8. Hustler Magazine, Inc. v. Falwell has already been decided – Here's my unsolved deliema, comment if you have a solution.
    The "rights" of a photograph go to the photogrpaher, not the model. So if I take a picture of female "A", I have the rights to that.
    So now I create a new picture that female A, now XXX, would find offensive. Does female "A" have a course of action, or are we now in
    a world where it is cool that anyone can be genereated doing anything?
    I've been making the mistake of focusing my attention on what Stable Diffusion can do. We (viewers of your channel) understand the limits,

    and no where near enough time thinking about how it will impact society.
    Furthermore, exactly how many days away will the first occurance of a "stable diffusion genererated" picture will surface
    as political leverage? The public already isn't fact checking "Jack", a picture of anything could sway the course of history, if
    people believe it real.
    the general public doesn't have a clue about what is possible now.
    A fake picture being created, to sway the public is a guarenteed ticking time bomb ~ the only thing up for discussion, is where and how it will go off.
    The comment I'm looking for would be: We stop this obvious bomb from going off by…….. (I haven't a clue)

  9. V4 doesn't seem to do likenesses of people very well now compared to the –test and –testp versions or SD. I think this was a deliberate move on the part of the MJ team

  10. To be honest, what baffles me about V4 is that despite all the amazing new features and aesthetics HANDS still suck. The same thing happens to Stable Diffusion 1.5 and even Dall-E sometimes generates weird ass hands. This should be as important as fixing the faces and eyes if they really are aiming to automate art, and the saddest part is that there are guys "unintentionally" fixing hands by just training a model with Dreambooth. I mean, there is an adult-oriented SD model that is 10 times better at making hands than SD 1.5 and MD V4, this is sad.

Leave a Reply

Your email address will not be published. Required fields are marked *

© 2025 AI Art Video Tutorials