This is a very good topic, and I hope a lot of experienced AI jockeys can reply.
I been using WebUI Forge, SwarmUI and ComfyUI, all tools within the Stable Diffusion architecture and one thing I've found is that the software just doesn't understand scales or sizes very well, especially when the subject in question doesn't exist at the prompted size in reality. For example, in your prompt you specify 1/100 scale. If the woman is originally 5 ft 5 inches tall, then she would be 1/100th of 65 inches or 0.65 inches tall (1.66cm) which she certainly isn't in the image above.
I've had the same issues where I prompt "3 inches tall", and get a whole range of sizes with little consistency. This is expecially true when I have a normal sized person in the same image. I can get close to the size scale I want if its just the subject woman with maybe a giant hand or some kind of giant household item like a cup or a spoon. But again, no consistency.
For example, this is the absolute closest I've ever gotten to my prompt of "3 inches tall". As you can see, the only other thing in the image is a giant hand, no background, nothing else. In fact, its not even an entire hand.
But pull back a little to get the whole hand, and all of a suddent the subject grows in size.
With even a partial face of a normal sized person in frame the subject grows even larger.
And with the entire face and upper torso of a normal sized person in the background the subject grows even more, despite the prompt's "3 inches tall" specification remaining unchanged.
Like I said, the software doesn't seem to know exactly what we're trying to produce and confuses scales and sizes, especially when normal sized people and objects are also included in the prompt. And prompting for "photorealistic" or "hyper detailed" actually seems to exacerbate the problem. I'd really like to hear from some of the AI experts out there and receive the benefit of their experience.