Let’s go for my web review for the week 2024-23.
Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models
Tags: tech, ai, gpt, machine-learning, safety, research
Another cruel reminder that basic reasoning is not to be expected from LLMs. Here is a quote from the conclusion of the paper which makes it clear:
“We think that observations made in our study should serve as strong reminder that current SOTA
LLMs are not capable of sound, consistent reasoning, as shown here by their breakdown on even such a simple task as the presented AIW problem, and enabling such reasoning is still subject of basic research. This should be also a strong warning against overblown claims for such models beyond being basic research artifacts to serve as problem solvers in various real world settings, which are often made by different commercial entities in attempt to position their models as a strong mature product for end-users. […] Observed breakdown of basic reasoning capabilities, coupled with such public claims (which are also based on standardized benchmarks), present an inherent safety problem. Models with insufficient basic reasoning are inherently unsafe, as they will produce wrong decisions in various important scenarios that do require intact reasoning.”
https://arxiv.org/abs/2406.02061
A critique of Cara.app: the ‘No AI’ Instagram and Artstation copycat child. - David Revoy
Tags: tech, social-media, art, criticism
Interesting critique of this new platform… it’s the beginning of the hype cycle but will probably exhibit the same decay phenomenon than other platforms.
https://www.davidrevoy.com/article1032/a-critique-of-caraapp-the-no-ai-instagram-and-artstation-copycat-child
Why are vulnerabilities out of control in 2024? – Open Source Security
Tags: tech, foss, security, data, data-science
The more releases out there the more vulnerabilities are (and could be) discovered. Some actions are necessary to get things under control properly.
https://opensourcesecurity.io/2024/06/03/why-are-vulnerabilities-out-of-control-in-2024/
Engineering for Slow Internet
Tags: tech, networking, reliability
A good reminder of everything which might go wrong when connectivity is bad. Most tools let you down in such a case.
https://brr.fyi/posts/engineering-for-slow-internet
Why do CPUs have multiple cache levels? | The ryg blog
Tags: tech, hardware, cpu
Very nice explanation and metaphors on how CPUs cache levels work.
https://fgiesen.wordpress.com/2016/08/07/why-do-cpus-have-multiple-cache-levels/
BenchExec: A Framework for Reliable Benchmarking and Resource Measurement
Tags: tech, benchmarking, tools
Looks like an interesting benchmarking tool. To keep an eye on.
https://github.com/sosy-lab/benchexec
TIL #099 – order values of dictionary by iterable of keys with operator.itemgetter | mathspp
Tags: tech, programming, python
Definitely a nice Python trick. Fairly elegant, I’ll try to remember it.
https://mathspp.com/blog/til/order-values-of-dictionary-by-iterable-of-keys-with-operator.itemgetter
Message authentication codes for safer distributed transactions
Tags: tech, filesystem, distributed, safety, cryptography
Interesting use of cryptography without a security concern. It’s more about safety and ensuring something wasn’t missed by mistake.
https://mazzo.li/posts/mac-distributed-tx.html
The state of Vulkan apps in 2024
Tags: tech, 3d, vulkan, portability
The difficult path for Vulkan. The data obviously is biased since it includes games and most of them are still targeting Windows and so DirectX. I’d be curious to see something similar excluding games (and so focusing on medical, industrial etc.).
https://www.carette.xyz/posts/state_of_vulkan_2024/
How I learned Vulkan and wrote a small game engine with it
Tags: tech, 3d, game, vulkan
Interesting dive into the experience of writing a small Vulkan engine (almost) from scratch.
https://edw.is/learning-vulkan/
How to Build Engineering Strategy
Tags: tech, management, strategy, vision
Packed with useful information. Clearly some things I’m eager to test in there.
https://blog.practicalengineering.management/how-to-build-engineering-strategy-42e464018177
xkcd: Earth Temperature Timeline
Tags: science, history, data-visualization
So yes, the climate changed before… now slowly scroll until the end to appreciate how brutal it is this time.
https://xkcd.com/1732/
Bye for now!