Mozillas petition to get an answer from Microsoft, is it using your data to train its AI?

Mr. Forager@lemmy.world to Technology@lemmy.world – 658 points –
Ask Microsoft if They Plan to Use Our Personal Data to Train AI
foundation.mozilla.org

Microsofts new Terms and Service agreement is rather questionable. In short; It does not clarify if Microsoft will use your data to train it's AI.

So Mozilla is calling for arms to sign their petition for Microsoft to give a proper answer! You can sign it here -> https://foundation.mozilla.org/en/campaigns/microsoft-ai/

Mozillas Context;

Ask Microsoft: Are you using our personal data to train AI? We had four lawyers, three privacy experts, and two campaigners look at Microsoft's new Service Agreement, and none of our experts could tell if Microsoft plans on using your personal data – including audio, video, chat, and attachments from 130 products, including Office, Skype, Teams, and Xbox – to train its AI models.

If nine experts in privacy can't understand what Microsoft does with your data, what chance does the average person have? That's why we're asking Microsoft to say if they're going to use our personal data to train its AI.

72

On my work PC it's painfully obvious that MS tracks every word you type into Teams and Outlook based on the clickbait shit they plaster all over the MSN homepage. It's always customized to include topics that were discussed in my work messages.

Nowhere in any of the Office365 land do you see a notification that they are analyzing everything you do, but it remains obvious that they are.

This leads to the reasonable conclusion that they will abuse your data for any avenue of profit.

That’s wild. Are you serious? Can you point to any proof or articles about that direct reflection of the snooping? I assume your employer had to agree to their information being used for advertising/etc.

No I haven't researched it at all, I have simply observed it in action as the crap they push through on a browser without an adblocker. Lots of very specific things related to the contents of my work discussions.

That is built inherently into the Windows OS. Open your resource monitor and check network activity. Put those IP addresses into https://www.ip-lookup.org/location

And then question why all that information is being sent out. Drivers, DRM for software and many other stuff have self reporting automation built into them these days.

Is mozilla the only company fighting for privacy?

Every company is fighting for profits. Be a fan of products, not brands.

Mozilla is a non-profit organization

Our mission is to ensure the Internet is a global public resource, open and accessible to all.

They sell some services to fund their other initiatives

Didn't Mozilla get most of its funding from Google for promoting its search engine? Or has that changed?

if the document doesn't explicitly say that they don't.. they do. and even if it did, odds are they (or one of their 'partners') do anyway.

Is there any guide for a windows noob that wants to switch to Linux? I mostly use software that manages my video and audio collection. I don’t know where to start.

Start using free software now, while you are still on Windows. Whenever you want to do something new, do a search for free software you can do it with. Then when you do finally switch, all the software you've been using is already right there.

1 more...

If you want to get emerged into the linux world and get broad understanding then I recommend watching videos on youtube by DistroTube. Adjacent, kinda more advanced channels are Luke Smith and Brodie Robertson.

If you just want to use linux and be done with that topic, you can use linux mint. What you have to know is that you get all software from the software center, not from websites. The rest should be very familiar.

I recommend a virtual machine on your Windows PC as a host.

Start simple, e.g. do all your web browsing in the Linux VM. Don't try to transition entirely to Linux in one go, that's too much. Once you're comfortable in the web browser, add one more piece of software.

Eventually get to the point where you're doing everything in the VM for a month or so, and then boot into it directly. Or perhaps buy a second PC and a KVM for your keyboard/mouse/monitor. Because you might find there's one thing (e.g. games) that works better on Windows.

what's a kvm?

keyboard video mouse switch (for using one set of keyboard, mouse, and video with multiple computers). Think of it like a channel changer

ok, that's cool. I'm tracking now, just wasn't familiar with the acronym. I could definitely see the benefit of being able to hotswap between environments like that. I'll have to remember that If I get to a point where I can dive into it fully.

You should first dual-boot. It means you will keep your Windows partition and when you turn on your computer, you can choose Windows or Linux to boot up.

To choose a distro, there are plenty of YouTube reviews. I'd recommend Ubuntu, Pop!_OS or Linux Mint for a beginner. Dual-booting is easy on these distros, you just have to select install alongside Windows and then how big you want the Linux partiton to be.

For putting on a USB, download the ISO of your chosen distro, and use BalenaEtcher to flash to your USB (it will erase everything from your USB, so back your data up). To boot into the USB, reboot while holding press Escape, and see if that brings up a boot device picker. If it doesn't, try other keys at the top of your keyboard or press the restart button in Windows 8+ while holding down Shift, wait for it to load, and in the blue menu, ho into Select boot device (or whatever it's called) and select the USB.

Before installing, you should check out if stuff works on Linux like audio (you can test these out because you are on a live system booted from your USB), and if it doesn't, check if you find a fix online, but everything should work fine.

For the software alternatives (if they aren't on Linux), I recommend alternative.to, and learn the new apps. When you feel comfortable, you can then move all your files to Linux and completely delete Windows (you should BTW be able to see your Windows partition from a files app).

Thanks everyone for being so friendly and wanting to help.

1 more...

Why ask when you know the answer is yes?

If the product is free, you are the product. Even when it's not free, you're still the product because data is too valuable.

although a good mentality, its not always correct. Also, this isn't just about "asking", it's fighting big corps/tech to be more transparent about their policies.

And if they say "yes", if they are blatant and transparent about their business model, that will somehow make it better? This idea of "putting sunshine on a problem" never actually solves anything. The problem company just comes back with "Yeah? So? What the fuck are you going to do about it?"

Well, the more negative feedback they get the more they will rethink it. Just like what happened to Googles proposed "Web Integrity" API recently. It recived a huge negative backlash and in return they dropped the idea, for now...

But if you're asking if "blatent and transparent" polices are better than the ones that are not, then the answer to that is a big fat YES ofcourse they are. I personally have had enough of google and microsoft so im staying away from their serivces as much as I possible can.

Shining light on a problem is a good step to make people realize there is a problem in the first place.

What the fuck are you going to do about it?

Start a meme campaign targeted at countries with privacy legislations, aimed at making their future governments ask for higher bribes more lobbying before signing away taxpayer money to Microsoft contracts...

I mean, ideally have Microsoft rethink its approach, like Meta is rethinking its with Instagram, but let's start with something simple.

Yeah we should just never do anything if it doesn't instantly fix the problem.

I'm just going to start posting in Esperanto. Even AI won't be interested in learning Esperanto.

Too late, it already has learned it:

Default (GPT-3.5)

User: Translate the following text into Esperanto: "I'm just going to start posting in Esperanto. Even AI won't be interested in learning Esperanto."

ChatGPT: "Mi ĵus komencos afiŝi en Esperanto. Eĉ la intelekta artifiko ne estos interesita lerni Esperanton."

5 more...

“Open”AI stole the open web and monetized it and made billions , and there are no solid legal consequences. So why Microsoft and other companies wont do the same? I mean Google is doing it and made an empire of it.

Good question.

For me, there is a difference - I feel differently about a company using stuff I posted on the open web vs messages I've sent on Teams, Skype, etc., which feel like they should be more private. There is probably also a legal/privacy angle for this difference too, for this same reason (?)

Was openai ever open? Or its just a name scheme

I've been meaning to look that up for a while

OpenAI was created as an open source (which is why I named it “Open” AI), non-profit company to serve as a counterweight to Google, but now it has become a closed source, maximum-profit company effectively controlled by Microsoft.

Interesting

What if i publish all my personal data myself. What if i also publish with a licence forcing anything that uses it to be opensource?

I mean, it is. They keep a list of all your conversations and they are extremely vague about giving a direct reply. Hopefully this does something because, like US congress has itself admitted, they cannot afford to let the same thing happen with advanced AIs that they've let happen with social networks. Transparency needs to be a thing, and not fake "oh yeah I'm all about transparency" then goes out of their way to hide shit under the carpet or gaslight with bullshit when they can't.

Something something spez the hurensohn also jumping in on the same bandwagon some time back?

I dont use anything made by microsoft so i dont think its using my data to train AI

It's probably ok for you to not use this form then.

I mean… Google already does this… We know Microsoft already does this. This feels like an attempt by Mozilla to garner attention from both the press and users to promote Mozilla accounts.

People like you are the worst on the internet

I love that this upset you so much, you commented about it. 👏👏👏

FWIW, This is calling out FUD.

Hold on, let me get this straight. You say that Google and Microsoft are already using your personal data for the benefit of their company profits, without permission - but that Mozilla are the bad guys for calling them out on it, and offering alternative products that don't exploit users. Is that right?

Why is everyone upset about personal data used to train AI?

  • They harvest it without your consent
  • They don't tell you what they harvest
  • They don't tell you what they use it for
  • It's your personal data

Yes, you could argue that by signing up for their services you give them perpetual permissions to do what they want with your data, which is what usually happens, but the issue already lies in that this is acceptable to begin with

Why should a company get to use my work and data for free to train their AI, for which they'll make a ton of money, without compensating me. At a minimum they should be informing me so I can make that choice with full knowledge.

This isn't a university or educational research either, this is one of the largest companies in the world with Billions of dollars in annual revenue. And to top it off, I already have to pay them for their operating system and annually for their office suite. So not only am I paying them for their product, they'll then steal my data to train an AI to try and sell that to us too?

That's not even taking into account any concerns with "AI might replace me at my job" that a number of folks have.

Not to mention the fact that if they include office products in this, its not just personal information.

A lot of IP gets produced in there, even if it's not purchased or created within an enterprise license. So if they train on that they will be basically stealing corporate information that they definitely have no rights to.

In theory they shouldnt. Society has given in too much regarding what data can be used and here we are.

I think any personal information should not be allowed access by third parties or tech companies. Your personal information is just that......personal. Unfortunately, it is about bottom line profit.

They compensate you in the form of providing products like Bing for free. Same way that Facebook pays their bills by running ads.

Lack of informed consent is reason enough.

Here's a deal. Give me unlimited lifetime access to your AI and you can use my public online data. Should be fair enough, right?

Beyond what everyone else has said, it has already been shown that LLMs have a chance of regurgitating training data, which means that someone's personal data could get returned in a Bing Chat query.