A bride to be discovers a reality bending mistake in Apple's computational photography

stopthatgirl7@kbin.social to Technology@lemmy.world – 258 points –
A bride to be discovers a reality bending mistake in Apple's computational photography
appleinsider.com

A U.K. woman was photographed standing in a mirror where her reflections didn't match, but not because of a glitch in the Matrix. Instead, it's a simple iPhone computational photography mistake.

98

This story may be amusing, but it's actually a serious issue if Apple is doing this and people are not aware of it because cellphone imagery is used in things like court cases. Relative positions of people in a scene really fucking matter in those kinds of situations. Someone's photo of a crime could be dismissed or discredited using this exact news story as an example -- or worse, someone could be wrongly convicted because the composite produced a misleading representation of the scene.

I see your point, though I wouldn't put it that far. It's an edge case that has to happen in a very short duration.
Similar effects can be acheived with traditional cameras with rolling shutter.
If you're only concerned of relative positions of different people during a time frame, I don't think you need to be that worried. Being aware of it is enough.

I don't think that's what's happening. I think Apple is "filming" over the course of the seconds you have the camera open, and uses the press of the shutter button to select a specific shit from the hundreds of frames that have been taken as video. Then, some algorithm appears to be assembling different portions of those shots into one "best" shot.

It's not just a mechanical shutter effect.

I'm aware of the differences. I'm just pointing out that similar phenomenon and discussions have been made since rolling shutter artifacts have been a thing. It still only takes milliseconds for an iPhone to finish taking it's plethora of photos to composite. For the majority of forensic use cases, it's a non issue imo. People don't move that quick to change relative positions substantially irl.

Did you look at the example in the article? It's clearly not milliseconds. It's several whole seconds.

You don't need a few whole seconds to put an arm down.

Edit: I should rephrase. I don't think computational photography algorithms would risk compositing photos that are whole seconds apart. In well lit environments, one photo only needs 1/100 seconds or less to expose properly. Using photos that are temporally too far apart risk objects moving too much in the frame, and thus fail to composite.

There's three different arm positions in a single picture. That doesn't happen in the blink of an eye.

The camera is taking many frames over a relatively long time to do this.

This is nothing at all like rolling shutter, and it's very obvious from looking at the example in the article.

Those arm positions occur over the course of a fluid motion in a single second. How long does it take for you to drop your hands to your side or raise them to clasped from the side? It doesn’t take me more than about half a second as a deliberate movement.

It takes you several seconds to move your arm? I hope you don’t do manual work.

Also did you use the iOS camera app before? You can see how long it takes for the iPhone to take multiple shots for the always-on hdr feature, and it isn’t several seconds.

There's three different arm positions in a single picture. That doesn't happen in the blink of an eye.

It's a lot faster than you might be expecting. I found it helps to visualize it in person. Go to a mirror and start with your hands together like in the right side mirror. Now let your arms down naturally, to the position in the left side mirror. If you don't move your arms at the same exact time, one elbow will still be parallel to the floor while the other elbow has extended already, just like in the middle position.

Thus, we can tell that the camera compiled the image from right to left.

I can also see the three arm positions being a single motion, just in three different time frames. If it really takes seconds to complete a composite, then it should also be very easy to reproduce, and not something so rare it makes it into the news. If I still can't convince you, I guess we agree to disagree then.

then it should also be very easy to reproduce, and not something so rare it makes it into the news.

And it is, according to the article. Just in case you haven't read.

It has made headlines not because it's rare, but because it's outrageous. Just in case you haven't noticed.

Please, feel free to reproduce one yourself then. And no, using the panorama trick doesn't count, which I think the "silly photos" in the article may be actually referencing instead of this.

And is it really "outrageous"? At most I think this is amusing. Nowhere in the article gave me the impression that this is something that people need to be extremely angry about, Mr. Just in case.

It should be. All computational photography has zero business being used in court

All digital photography is computational. I think the word you're looking for is composite, not computational.

Unless the dude is saying only film should be admissible, which doesn't sound all that bad.

Film is also subject to manipulation in the development stage, even if you avoid compositing e.g. dodging and burning. Photographic honesty is an open and active philosophic debate that has been going on since its inception. It's not like you can really draw a line in the sand and blanketly say admissible or not. Although I'm sure established guidelines would help. Ultimately, it's an argument about the validity of evidence that needs to be made on a case by case basis. The manipulations involved need to be fully identified and accounted for in those discussions.

With all the image manipulation and generation tools available to even amateurs, I'm not sure how any photography is admissible as evidence these days.

At some point there's going to have to be a whole bunch of digital signing (and timestamp signatures) going on inside the camera for things to be even considered.

3 more...

I'm still waiting for the first time somebody uses it to zoom in on a car number plate and it helpfully fills it in with some AI bullshit with something else entirely.

We've already seen such a thing with image compression.

https://www.zdnet.com/article/xerox-scanners-alter-numbers-in-scanned-documents/

This was important in the Kyle Rittenhouse case. The zoom resolution was interpolated by software. It wasn't AI per se, but the fact that a jury couldn't be relied upon to understand a black box algorithm and its possible artifacts, the zoomed video was disallowed.

(this in no way implies that I agree with the court.)

The zoom resolution was interpolated by software. It wasn't AI per se

Except it was. All the "AI" junk being hyped and peddled all over the place as a completely new and modern innovation is really just the same old interpolation by software, albeit software which is fueled by bigger databases and with more computing power thrown at it.

It's all just flashier autocorrect.

As far as I know, nothing about AI entered into arguments. No precedents regarding AI could have been set here. Therefore, this case wasn't about AI per se.

I did bring it up as relevant because, as you say, AI is just an over-hyped black box. But that's my opinion, with no case law to cite (ianal). So to say that a court would or should feel that AI and fancy photoediting is the same thing is misleading. I know that wasn't your point, but it was part of mine.

I watched that whole court exchange live, and it helped the defendant's case that the judge was computer illiterate.

As it usually does. But the court's ineptitude should favor the defense. It shouldn't be an arrow in a prosecutor's quiver, at least.

This isn't an issue at all it's a bullshit headline. And it worked.

This is the result of shooting in panorama mode.

In other news, the sky is blue

3 more...

Saved you a click

https://photos5.appleinsider.com/gallery/57517-117166-bride-trio-xl.jpg

Preventing people from perpetuating clickbait "journalism" is so punk rock.

Not even a mistake, this is unavoidable if you move during a panorama. iPhones can't pause time. Cool photo tho

Thank you for saying this. If you have ever shot a panoramic shot you know how steady you need to keep it on the line, otherwise you get a lot of weird things like this, not to mention if your moving while it’s happening.

Uhm, ok?

The way the girl’s post is written, it’s like she found out Apple made camera lenses from orphans’ retinas (“almost made me vomit on the street”). I assumed it was well known that iPhone takes many photos and stitches the pic together (hence the usually great quality). Now the software made a mistake, resulting in a definitely cool/interesting pic, but that’s it.

Also, maybe stop flailing your arms around when you want your pic taken in your wedding dress.

When have panorama photos ever not done weird stuff?

Seriously? She almost vomited because the photos didn’t match? Give me a fucking break!

You think that's absurd? Have you never gotten married? Wedding photos are extremely important and while "she almost vomited" may be hyperbole, I can definitely understand being very pissed off if that was the only version of the photo. Our wedding photographer whitened our teeth in our photos and we requested that they undo that so we look like ourselves. The sentiment was nice, but we didn't want that. I would have been pretty unhappy if they hadn't held onto the originals and were unable to revert our teeth back to their normal shades. Photos of our bridal showers and dress hunting were nearly as important as the wedding photos themselves. I can understand being upset with this undesired result.

Ah yes, I remember noticing it would make like a short video instead of one picture, back when I had an iPhone. I turned that function off because I didn't see the benefits.

That’s not what this is. I also turned that off, it’s called “Live Photo” or something like that. Honestly I find it to be a dumb feature.

What this is, is the iPhone taking a large number of images and stitching them together for better results.

It's not dumb. It let's you select the best moment within a 1-2 second margin after or before you took the picture.

No, these are literally just short videos. You interact with them like photos, you see them as photos, half the time people sending them think they are photos, but when you tap all the way into them they are a short video. They are absolutely not presented as a “choose your exact frame” pre-photo things, they are presented as photos.

Yeah "Live photo" really is just an Apple marketing term. You interact with them in a certain way on iOS and they are presented in a certain way, but anywhere else they're just very short videos.

Wrong. Pretty crazy, it does let you change which frame is the photo. Click edit, then hit the Live Photo icon next to “cancel”

That isnt the point of a Live Photo, that’s just a “feature.” Similar to how YouTube lets you choose a thumbnail for a video, but that’s not really the point of YouTube.

Per Apple support:

With Live Photos, your iPhone records what happens 1.5 seconds before and after you take a picture. Then you can pick a different key photo, add a fun effect, edit your Live Photo, and share with your family and friends.

So it’s actually the first example of what Live Photo is for.

If you didn’t even know about this, don’t feel bad. I’m an Apple fanboy and my daughter just showed me that it allowed you to do this “different key photo” last month. Kids are good for that.

I’m aware that’s it’s possible, but that isn’t part of the onboarding or anything. What I mean is, it’s an addon. It was never part of the original iteration, which was just “look moving Harry Potter photos.”

It’s a gimmick that doesn’t even work cross device, because it’s literally just a short video.

I’m not following where you’re coming from. Are you just going to hate on it?

Because v1 of something didn’t have a feature, it means it should be discounted from discussion?

There’s several neat things about Live Photos, and the arguably nearest thing is being able to change the key frame. Unintended side effect of this feature apple included, maybe. But that’s not relevant.

I’m simply stating that they are short videos. The fact that they can do more is secondary, platform specific, and gimmicky. You send that “love photo” to an android device, a PC, an older iPhone, etc it all falls apart and it’s literally just a short video again.

The ability to change the “cover image” doesn’t mean it’s meant to allow you to pick the perfect frame, that’s just a secondary feature. You’re still sending the entire short video along.

But… no? When I change the key frame (the image) of the video, then send that to someone as a picture, the picture is of that key frame. If I send over iMessage as Live Photo, it’s both a pic and a video.

So it gives the the ability to slightly adjust when the picture was taken to get that perfect shot with no one blinking and things look just right. Basically, every picture is like 50 pictures and I can pick the best. By default, it picks the middle.

Call it what you want, but it’s integrated pic + vid on every picture. Yes. Coupled with simple tools to leverage that for some nice functionality.

If your argument is just “but it’s simple, it’s just a video” then you’re ignoring the UX execution entirely.

90% of people share their photos from the messages app, they aren’t going into their photos and explicitly sharing what they already think is a photo, as a photo.

I get where you’re coming from, but “this buried feature most don’t know about” isn’t what I’m talking about, I’m talking about what the majority of people experience when using them.

For instance, if you take a photo in the messages app, and don’t turn off the Live Photo option, you are presented with what looks like a photo to send. You aren’t given options to change the key frame, you aren’t given much warning that it’s actually a video, for all intents and purposes it’s a photo to most users. Which is why I’ve been sent bits and pieces of conversations people didn’t know they were sharing.

It's a really cool discovery, but I don't know how Apple is suppose to program against it.

What surprises me is how much of a time range each photo has to work with. Enough time for Tessa to put down one arm and then the other. It's basically recording a mini-video and selecting frames from it. I wonder if turning off things like Live Photo (which retroactively starts the video a second or two before you actually press record) would force the Camera app to select from a briefer range of time.

Maybe combining facial recognition with post processing to tell the software that if it thinks it's looking at multiple copies of the same person, it needs to time-sync the sections of frames chosen for the final photo. It wouldn't be foolproof, but it would be better than nothing.

Program against it? It's a camera. Put what's on the light sensor into the file, you're done. They programmed to make this happen, by pretending that multiple images are the same image.

That’s over simplified. There’s only so much you can get on a sensor at the sizes in mobile devices. To compensate there’s A LOT of processing that goes on. Even higher end DSLR cameras are doing post processing.

Even shooting RAW like you’re suggesting involves some amount of post processing for things like lens corrections.

It’s all that post processing that allows us to have things like HDR images for example. It also allows us to compensate for various lighting and motion changes.

Mobile phone cameras are more about the software than the hardware these days

With a DSLR, the person editing the pictures has full control over what post processing is done to the RAW files.

Correct, I was referring to RAW shot on mobile not a proper DLSR. I guess I should have been more clear about that. Sorry!

You might be confounding a RAW photo file and the way it is displayed. A RAW file isn't even actually an image file, it's a container containing the sensor pixel information, metadata, and a pre-generated JPG thumbnail. To actually display an image, the viewer application either has to interpret the sensor data into an image (possible with changes according to its liking) or just display the contained JPG. On mobile phones I think it's most likely that the JPG is generated with pre-applied post-processing and displayed that way. That doesn't mean the RAW file has any post-processing applied to it though.

1 more...

Raw files from cameras have meta data that tells raw converters the info of which color profile and lenses it's taken with, but any camera worth using professionally doesn't have any native corrections on raw files. However, in special cases as with lenses with high distortion, the raw files have a distortion profile on by default.

Correct, I was referring to RAW shot on mobile devices not a proper DSLR. That was my observations based off of using the iPhone raw and android raw formats.

This isn’t my area of expertise so if I’m wrong about that aspect too let me know! 😃

Oh, so you have no idea what you're talking about.

So what was I wrong about? I’m always happy to learn from my mistakes! 😊

Do you have some whitepapers I can reference too?

How about a couple decades of industry experience instead?

Gonna provide more information or is this just a trust me bro situation?

Not sure what I'd have to gain from just lying on the Internet about inconsequential things.

Also not sure I can disclose too many technical details due to NDAs, but I've worked on camera stacks on multiple Android-based devices. Yes, there's tons of layers of firmware and software throughout the camera stack, but it very importantly does not alter consequential elements of images, and concentrates on image quality, not image contents.

While the sensors in smartphones might not be as physically large as those in DSLRs - at least, in general - there's still significant quality in the raw sensor data that does not inherently require the sort of image stitching that Apple is doing.

Oh, so your excuse is you are illiterate?

🙄

Edit: oh, you're the actual illiterate person from another post. Thanks for stalking me.

You think too highly of yourself.

When you comment spam just about every thread you’ll come across people multiple times.

1 more...

What's on the light sensor when? There's no shutter, it can just capture a continuous stream of light indefinitely.

Most people want a rough representation of what's hitting the sensor when they push the button. But they don't actually care about the sensor, they care about what they can see, which doesn't include the blur from the camera wobbling, or the slight blur of the subject moving.
They want the lighting to match how they perceived the scene, even though that isn't what the sensor picked up, because your brain edits what you see before you comprehend the image.

Doing those corrections is a small step to incorporating discontinuities in the capture window for better results.

1 more...

Or maybe just don’t move your arm for literally less than a second while the foto(s) is/are taken.. Moving your arm(s) down happens in less than a second if one just let them fall by gravity. It’s a funny pic nonetheless.

1 more...

I may have missed this in the comments already but it is really important to note here that the article says the photo was taken using panorama mode, which is why the computational photography thing is even an issue. If you have used panorama mode ever you should go in expecting some funkiest, especially if someone in the shot is moving, as the bride apparently was when it was shot.

There’s a note at the end of the article that says it was take using pano. So this is doubly unsurprising. Despite the instagram caption reading it wasn’t.

This person is an actress and comedian. This is not an iPhone error; it's just a manually-edited photo from three separate takes that she pretended came out of the phone as-is. It's a hoax for laughs/attention.