Microsoft announces Python formulas in Excel... which have to get sent to the cloud

Aatube@kbin.social to Technology@lemmy.world – 518 points –
techcommunity.microsoft.com

Since its inception, Microsoft Excel has changed how people organize, analyze, and visualize their data, providing a basis for decision-making for the flying billionaires heads up in the clouds who don't give a fuck for life offtheline

97

Damn they found a way to make python slower

Emulate that shit with Java!

java isn't slow

I mean, whatever speed java has or doesn't have, what the other person said was emulate, you'll have your os then on top of that the JVM then on top of that your python implementation, then finally the python code. If that's faster than os->python imp..

Just like Python doesn't run from the source code through the interpreter all the time (instead, if I'm not mistaken, the interpreter pass converts the code to a binary runtime form, so interpretation of the source is done only once), so does "modern" Java (I put modern between quotes because it's been like that for almost 20 years) convert the code in VM format to binary assembly code in the local system (the technology is called JIT, for Just-In-Time compiler).

Plus Java IS slow, quite slower than compiled languages at least

java is a compiled language

Eh...Java source code compiles into bytecode which runs in a virtual machine. Compare this to a language like C which compiles to native machine code. Java still gets interpreted.

The bytecode is turned into native code before execution

That's not how it works. If that really was how it worked there'd be no point even having bytecode; you'd just straight up get the native code. Unless you're talking about JIT, but your wording seems to be implying that all the bytecode turns into native code at once.

I was referring to JIT but there are also other options like GraalVM for AOT compilation.

Performance of non-native bytecode such as Web Assembly can reach quite close (80%-ish) to machine code.

Yeah, in my personal experience (with numerical compute-heavy code), normal python code, ran in the normal python interpreter, is much slower than the equivalent normal Java code with the normal Java VM (like 50x). Then C/Fortran is ~2x faster than Java (with gcc + optimization flags).

I think Java is a good middle-ground between coding speed and execution speed. Sadly, it seems to be dying. And JavaFX is shit for trying emulate full-stack web-dev. The fucking ancient Swing is even better.

Scala and Kotlin are OK, but I think they are making the mistake of feature-creep that causes large projects with many people to contain multiple programming paradigms that only some of the team can grok well, instead of a restricted OOP Java codebase that encourages Gang of Four style code. Though, I guess GoF-style code resulted in that crazy complicated "enterprise" Java shit.

Last I checked Java was alive and well in the server-side for things like middleware and backend, especially because the whole development ecosystem is incredibly mature and significantly more stable and well integrated with corporate-category systems than pretty much anything else (good luck managing a single reliable transaction across, say, 2 different databases in 2 different sites and 1 MQ system with Python).

Absolutelly, it's been mostly limping in a half-dead state on the UI ever since day 1 and even Google using it with Android didn't exactly help (because Google's architectural design of the entire Android framework is, well, shit, and has become worse over time).

It also lost it's proeminence in dynamic web page generation at around the early 00s to actual templating languages (such as PHP) with a much lower learning curve and later to Python.

The ecosystem for Java is rock-solid and in widespread use in corporate multi-tier architectures that require reliable operation (were, for example, it's native multi-threading synchronisation support and core libraries make a huge difference) and integration with professional backend systems, but for the rest, not so much (I did both that stuff and Android, and the latter is like the amateur-hour of Java ecosystems in comparison with the former).

At least Android has switched to Kotlin which is rightfully superior.

The problem in Android has always been that the framework design is pretty bad in design and technical architecture terms and its evolution over time has made its glaring flaws more obvious and actually made it even messier, rather than the language (Java is fine as languages go and UI stuff only has to run in user-time, so response times of 100ms are fine and bleeding edge performance is not required).

Further, splitting the user base into two languages, by introducing a new language that is not used anywhere else (hence you're not going to find Kotlin programmers from outside Android development whilst you will find plenty of Java programmers) is one of the stupidest technical architecture decisions I've seen, and I've been in the industry for over 2 decades.

Last but not least, the gains from the small programming-time efficiency advantages of Kotlin over Java are a drop in the ocean next to the losses due to the Android Framework itself being badly designed (something as simple as not having functions in different core classes that do the same thing named the same).

Even for programmers going for Kotlin is a less than wise career move: as an Android-only language those who specialize in it are locking themselves into programming for Android only and have fewer career options - hands up anybody who expects to still be programming Android in 10 years time! The great thing of generic languages is that there are a lot of lateral career moves you can make without the high likelihood of failure that comes from hiring managers naturaly prefering people with several years of experience in the programming language used in their projects over people who say "I've mainly done Kotlin but I can learn that easilly".

What many years of experience in the industry tells me is that you don't want your career to hang on the ficklness of a vendor, any vendor, especially the likes of Google who will drop massivelly hyped systems with tons of 3rd party investment whenever they feel like: just ask everybody who invested in developing for Stadia.

I code Kotlin for a hobby and have never heard of Android frameworks. While the largest use case is still Android it IS a very good language, and there are a LOT of things online about Kotlin being used outside of Android. Kotlin is a generic language; you don't need Android stuff at all to use it.

6 more...
6 more...
6 more...
6 more...
6 more...
6 more...
6 more...
14 more...
14 more...
14 more...

I used to make high performance distributed computing server-systems for Investment banks.

Since the advent of Just In Time compiling, Java isn't slow if properly used.

It can however be stupidly slow if you don't know what you're doing (so can something like Assembly: if you're using a simple algorithm with a O(n) = n^2 execution time instead of something with O(n) = n*log(n) time, it's going to be slow for anything but a quantum computer, which is why, for example, most libraries with sorting algorithms use something more complex than the silly simple method of examining every value against every other value).

Just because you can make slow assembler doesn't make Java fast.

Java is clunky and not fast compared to C++ which is clunky and fast, Python which isn't clunky but slow bla bla bla bla...

That's oversimplifying things.

For things like algorithms Java can be as fast as C++ because ultimatelly it all ends up as the same assembly code: the differences in performance between one and the other are within the same range as the differences in performance between different C++ compilers or even optimization flags chosen and ultimatelly boil down to how good the implementation of the Java Virtual Machine is for a specific architecture (things like whether the generated assembly takes in account the way the microcontroller handles conditional jumps and cache misses).

Were Java is slower than C++ is in things like memory management: even the best garbage collection will never beat the kind of carefully handcrafter managing of memory that you can do in C++, though the upside of the different memory management architecture in Java is that your programs don't just start behaving in weird ways because you incorrectly accessed a variable in stack using a pointer and overwrote other stuff or even the function return address.

Then again if performance is your greatest consideration you would go for C, not C++ (note that I didn't say Assembly, as good C compilers are actually better at optimizing than most Assembly programmers).

Well that was a lot of misconceptions.

So compiler will make java as fast as C++ but not C++ as fast as C?

Also, when speed matters it's never great to have huge Java classes, it just won't be optimised away (anough) and you'll have a memory / bus bottleneck.

Also, if you really want speed you go parallel, IDK if java is up for the challenge lol (can you configure stackspace and so?) or to beat them all, you run it on the GPU (or several GPU lol :-)

I think I didn't explain myself correctly.

The Just In Time Compiler in a Java Virtual Machine which does a final compilation step at runtime from JVM assembly to native code will make the typical code in algorithms run as fast as in C++ because ultimatelly they both end up as the same assembly. I've actually measure this by the way, though it was years ago.

However things like memory architectures are different between those languages and the one in C++ can easilly be made faster than in Java, though the downside of that memory architecture is nastier bugs.

Further, and this time about general performance, if you're worried about performance above all, then in terms of general performance, C is generally faster than C++: for example virtual functions in C++ objects have calling overheads which function calls when there is no inheritance do not have so unless you don't use inheritance at all, some function calls in C++ will be slower.

As for parallelization, I've actually worked both in massive parallel Java with distributed computing and GPU computing and they're completelly different kinds of parallelization with different capabilities and optimal use cases: good luck making a CUDA application optimized for serving paralled requests from millions of clients sourcing data from multiple sources and good luck making an LLM runtime with distributed computing in Java were parts of the pre-trained neural network are in different machines - GPU computing can't arbitrally decide it needs some data and fetch it whilst the overhead of synching two parts of a Java system running in different machines is way (millions of times?) larger than the overhead of synching read-then-write-access to the same position in a RWStructuredBuffer from 2 different processing units.

Comparing these two kind of parallelism is not an apple and oranges comparison, it'sm more like an apples and horseshoes comparison.

14 more...

That made a python interface wrapper arround VB Script.

14 more...

Wow, Microsoft is now copying LibreOffice. Who could have guesses?

The difference is Microsoft will feed your scripts into training its AI.

Microsoft is stupid, but not too stupid to realize that Excel users are generally tech-illiterate and most of them will produce garbage code.

Gpt/Bing attempts at python code for excel lmao, full circle, ai training on itself

11 more...

I don't get it. Why I need cloud to run Python scripts which can be done locally? Installing Python isn't hard and MS can bundle it as a library with Office either.

I've been asking this for every other cloud service - either companies are jumping on the 10-year-old bandwagon or want to collect data for AI training purposes in a way you cannot just disable in the settings. And of course you cannot self-host your own server.

This sounds like a security check. Our liability and ransomware insurance both require scripts to be turned off for excel and word.

I believe security threats can be mitigated locally without resorts to cloud.

Actually, one can argue using cloud is less secure because there is a risk of sensitive data leaked out of cooperate network.

You could argue that, but I wouldn't recommend it.

Microsoft has a massive amount of resources to throw at securing their environment, whereas most businesses simply don't have the ability to field a dedicated security team. The solution many reach is to offload risk to your software vendor, in this case Microsoft. Then, if there is data lost, it's Microsoft's fault, and it's their problem to fix, too. It's not ideal, but it's the world we're living in.

Cloud doesn’t have access to local drives…but in this day and age, python could be containerized or sandboxed. Sounds messy though.

But Micro$oft never implement any permission system or checks for Visual Basic in Office so any macro can use anything the user has access to. If the scripting language could only access its document’s content in an undoable way without explicit permissions such as to use the filesystem and Internet or modify the Normal template (as opposed to the current system, which does not differentiate between useful scripts and malware and can easily be bypassed by social engineering), the risk would largely be mitigated but Micro$oft does not care.

When you save your doc to one drive then you can access it from the web version of office. That's the reason they've been encouraging developers to write add-ins that run from the cloud. I'm guessing that this is for similar reasons

You can also access docs by uploading it to the web version of office. Correct me if I'm wrong but last I remembered add-ins don't choose between desktop and web.

Not everyone has an opportunity to work with Python in their work environment. I'm on the "business" side of the company, capable of doing most of programming stuff myself (Python, C#, SQL, etc.), whereas only "IT" people can work with the proper compileable code. And I'm left out working with VBA macros, or ask IT to write a script for me, which will take 1 year to develop. This change now will improve my local productivity for sure.

This issue isn't about authoring the script, is about why it needs to execute on the cloud rather locally.

Fair point, maybe I replied to a wrong comment. Nonetheless, I've seen comments saying "just use native Python". Not everyone can do that.

This happened in my old place - also on the business side. Asked for python, got it, then had it immediately removed because of security risks.

I told the head that I could still access tables and shit via excel if I wanted so what does it matter? He didn't realise this, and asked that I told no-one else it could be done. FFS.

Does anyone not think Microsoft is going to use all their cloud data for training language models?

"We promise not to" doesn't seem realistic to me. Proving they used it is impossible.

“We promise not to”

Where did you quote this from? I think they won't care, just like GitHub Copilot, or it will be an opt-out feature.

Surprising no one. You can't even autosave files in Office software anymore unless you use OneDrive.

unfortunately the local storage technology just isn’t there yet. we have to rely on the magic of the cloud to handle complex things like auto saving files and running python interpreters

😂😂😂

What does Lemmy think about emoji

In general I think emojis add nonverbal context clues to conversations and in my experience are a valuable tool for conveying information quickly. I'm seeing them used in all but the most formal of cases in a work environment and I don't think that particular train is going to be stopping anytime soon.

That gives me a great idea to design emojis for corporate use. The ones that we have are unsuitable but like you said.. They add a layer of communication

Yeh that pisses me off. When I looked that up, I saw that on the Microsoft help forums their response was 'well, you never really had that feature locally anyway'.

This is inside-out programming. I want my code to read data files, not my data files to contain code.

The first example is how to take cells in the sheet and make a data frame in an Excel equation. That's easy, pandas.read_excel(): no clound needed, no need to hunt through cells of a sheet to find your code.

Holy shit Excel is still like it was 20 years ago. And now, when they want to add such a useful feature, it comes with that bag of crap with it? Fucking hell. I know why I switched all non-trivial stuff to python.

Defining reusable functions? Diagrams with parametric ranges? (Only the title can be a cell reference, nothing else) Zooming in diagrams? More than 1 x axis? More than 2 y axis? ...

Even a basic thing like XY scatter plots is absolutely terrible. I had made dozens of them before but in 365, when I selected the data and clicked "Scatter Plot", it refused to assign the left column to X and the right to Y. I fought for 10 minutes with "Chart Series" and then gave up to look up the solution. Also very few trendline functions are available, the default axes look like crap and min/max is broken for logarithmic charts.

Not to mention that one third of all options is in a keyboard-unfriendly, laggy "responsive" UI while the rest is in windows that barely changed since 1995, and are completely missing in the web version. Localization has only become sloppier and you want centimeters rather than inches in Office Online? Fuck you.

I agree with the rest but I think the ribbon concept itself is nice and they actually updated the UI for Windows 11

I like Ribbon too but the Home tab keeps getting used to give spotlight to rather unimportant new features. Also, the Windows 11 redesign is horrible, the buttons are spaced out too far. At home, I use my old legal copy of Office 2007 via Wine of I need to correctly open Office files while getting used to the messy icons and dropdowns of LibreOffice.

LibreOffice also has a ribbon. Just go to View → User Interface and select "Tabbed Bar" or something like that. I like contextual groups (which needs experimental features enabled, unfortunately) better since it's like the MS office "group descriptions" but it isn't fully suited for Impress yet so groupedbar (non-compact version also experimental) is a nice middle ground.

Oh those numbers? That is a date. I will delete the original numbers and replace it with something else. Good luck.

Excel is one of those examples of a software that is only used everywhere because it used to be the best and it is overall not bad. So "everyone" already knows it and the incentive to move to something else is fairly small, because it would only be so much better. But still, why are they not developing it further? Completely crazy.

I feel like excel is still used everywhere because it's the kind of Jack of all trades.

There is a better alternative for all the applications excel is used for but no alternative that can do all at once.

LibreOffice? I mean some features are missing but I don't see why you'd need them

Maybe it's because I'm so accustomed to Excel, but Libreoffice Calc is so painful to use. Not the app that's going to win people over.

Not developing it further? They are bound to their shortsightedly developed xlsx format and they need to make sure legacy stuff doesn't break... However, the Win11-style redesign is a regression, and the Czech spell checker has been flagging a completely normal verb class as ancient for years, and the heavily-touted Web Sources cannot even parse JSON. They also keep adding shit like Online Images (with Bing), automatic image recognition that I imagine collects shitloads of AI training data and of course support links that only open in Edge even though it works no better than your default browser. I wonder where the prosecutors who brought M$ to court for bundling IE with W95 are today.

Unfortunately that case only ended in an order for Microsoft to share its API for at least 5 years and for the DOJ to have full access to MS stuff including source code during this time

So this is basically a native version of xlwings that requires exposing your excel data?

Does xlwings require any specific Python installation? Can I use it with my WinPython 3.10 installation on Windows 10?

As long as it can run Python, yes. It functions as a library just like VBA.