Show HN: MacMind – A transformer neural network in HyperCard on a 1989 Macintosh

edwin · 2026-04-16T17:23:38 1776360218

There’s something quietly impressive about getting modern AI ideas to run on old hardware (like OP's project or running LLM inference on Windows 3.1 machines). It’s easy to think all the progress is just bigger GPUs and more compute, but moments like that remind you how much of it is just more clever math and algorithms squeezing signal out of limited resources. Feels closer to the spirit of early computing than the current “throw hardware at it” narrative.

wdbm · 2026-04-16T18:38:37 1776364717

There is an absolutely beautiful rendering of the Mona Lisa encoded at some point in the digits of pi. If you know the position, it's really easy to plot the image.

But first you have to find that position.

zoky · 2026-04-16T23:01:11 1776380471

This is both simultaneously false, and true but largely meaningless. If you mean the Mona Lisa is somehow directly encoded somewhere in pi, then of course it’s not. It’s just a number.

If you mean that when you feed the numbers starting with some offset of pi into a specific algorithm you will get a rendering of the Mona Lisa, then yes, but so what? Allow me to introduce you to the PiMona algorithm. I won’t bother you with the implementation details, but it takes exactly one integer parameter. If it’s 3, it produces a beautiful rendering of the Mona Lisa. Anything else and it generates random garbage. Turns out, it’s really easy to find where the Mona Lisa is encoded in pi! It’s right there at the start.

But let’s say you meant that the digits of pi at some offset, when encoded properly and fed into any algorithm that is theoretically capable of generating the Mona Lisa will cause that algorithm to do so, then sure. But that’s also true of random noise, and says more about the algorithm and the nature of random numbers than about the Mona Lisa somehow being encoded into the fabric of the universe (which I’m sure isn’t what you meant, but I’m just saying there’s nothing really special about pi in that regard, except that as far as we know, it continues infinitely).

hammer32 · 2026-04-16T19:14:36 1776366876

Exactly. Working in a constrained environment invites innovation.

Unbeliever69 · 2026-04-16T19:25:51 1776367551

Now do this on a Casio Watch next :)

hyperhello · 2026-04-16T15:07:21 1776352041

Hello, if there are no XCMDs it should work adequately in HyperCard Simulator. I am only on my phone but I took a minute to import it.

https://hcsimulator.com/imports/MacMind---Trained-69E0132C

hammer32 · 2026-04-16T16:37:23 1776357443

I had no idea your simulator existed. No XCMDs, correct; everything is pure HyperTalk. I just ran a few training steps and they complete in a second or two. Thank you for importing it!

hyperhello · 2026-04-16T16:43:14 1776357794

I gotta ask. Your scripts have comments like -- handlers_math.hypertalk.txt at the top. Are you using some kind of build process for a stack?

hammer32 · 2026-04-16T17:11:54 1776359514

More of a copy-paste process. The scripts are written as .txt files in Nova on my Mac Studio, then pasted one at a time into HyperCard's script editor on the classic Mac. The files are kept separate because SimpleText has a 32 KB text limit.

hyperhello · 2026-04-16T17:23:56 1776360236

As an alternative, you might consider letting Hypercard itself open the text files and 'set the script of' as needed.

hammer32 · 2026-04-16T18:18:15 1776363495

Yup, that would have been easier. It's been decades since I've done anything with HyperCard. I had to re-take the built-in intro course again :)

jasomill · 2026-04-17T01:35:50 1776389750

Would that overcome the size limit?

Does HyperCard implement its on text handling for the HyperTalk editor that doesn't rely on the TextEdit toolbox service (which IIRC is the source of SimpleText's 32 kB limit)?

hyperhello · 2026-04-17T01:52:49 1776390769

Fields appeared to use TE and I suppose the script editor was pretty much limited to 32 kB of text for that reason, although you could have any size of text in a variable.

jasomill · 2026-04-17T02:06:00 1776391560

Curiousity got the better of me, and I just tested it in Infinite Mac.

The HyperTalk editor is indeed limited to 32 kB.

It's certainly possible that this limit only applies to editing scripts, as it's unlikely TextEdit was used in the process of interpreting them, but I don't have time tonight to investigate.

Later versions of HyperCard supported OSA scripts as well, now I'm also curious what the size limit is for (presumably) compiled AppleScripts stored in HyperCard stacks.

watersb · 2026-04-16T22:19:15 1776377955

This is great!

I first studied back-propagation in 1988, at the same time I fell in love with HyperCard programming. This project helps me recall this elegant weapon for a more civilized age.

hammer32 · 2026-04-17T11:25:31 1776425131

Building this definitely felt like constructing a lightsaber from spare parts: slow, deliberate, but it works and you understand every piece of it.

nxobject · 2026-04-17T00:07:54 1776384474

I love this. From reading the nuts-and-bolts "parameters" (haha) of your implementation, I get the impression that the fundamental limit is, well, using a 32-bit platform to address the sizes of data that usually need at least 48 bits!

hammer32 · 2026-04-17T11:31:02 1776425462

Thanks! The precision was a happy surprises, HyperTalk uses Apple's SANE library, which gives you 80-bit extended precision. The interpreter speed and the lack of arrays were a challenge. Rediscovering what HyperCard could do was half the fun of this project.

gcanyon · 2026-04-16T14:50:46 1776351046

It's strange to think how modern concepts are only modern because no one thought of them back then. This feels (to me) like the germ theory being transferred back to the ancient greeks.

hammer32 · 2026-04-16T14:59:32 1776351572

Right? Backprop was published in 1986, a year before HyperCard shipped. Attention is newer, but a small model like this was buildable.

jeffbee · 2026-04-16T19:00:17 1776366017

People did think of many of these core concepts decades ago, but they did not have the resources to put them into practice.

anthk · 2026-04-16T15:56:51 1776355011

Lisp is from 1960's and with s9 you can do even calculus with ease, in an interpreter small enough to fit in two floppies.

On the Greeks, Archimede almost did 'Calculus 0.9'.

kdhaskjdhadjk · 2026-04-16T16:26:16 1776356776

I think it's incredible to see the potential that is still locked up in old hardware. For example the 8088 MPH demo. Amazing what he was able to do with an 8088 and CGA. All this time the hardware had that potential, but it took decades to figure out how to unlock it, long after the hardware was considered obsolete. Imagine the sort of things that might be done later down the road with hardware of 0-20 years ago if somebody really dug into it to that level.

ashleyn · 2026-04-16T16:44:38 1776357878

Retro console homebrew and demoscene are all about this. There's a lot of fun stuff going on in N64 homebrew right now: https://www.youtube.com/watch?v=rNEo0aQkGnU

anthk · 2026-04-17T07:28:36 1776410916

On the N64, an equirectangular viewer a la QT3D or the current street view is not precisely a wonder.. m68k's could do that at a similar resolution. It's simple 3D in the end.

For the rest, yes, it's really astounding until you push these polygons while moving around in a game loop...

qingcharles · 2026-04-16T19:48:46 1776368926

8088 MPH demo is revolutionary. I have a plan to try and backport the developments from that demo, plus other optimizations learned in the last 40 years, back into the original 8088 Elite PC version. I had Gemini Pro write a PoC using 8088 assembler to create a CGA flat-poly renderer for the ships, which worked great. Next step is to use Claude to disassemble the original Elite binary so I can figure out where the rendering code lives and try to start patching it.

andai · 2026-04-16T18:50:27 1776365427

8088 MPH: this one, right?

https://www.youtube.com/watch?v=yHXx3orN35Y

tomcam · 2026-04-16T16:56:43 1776358603

That 8088 MPH demo is a tour de force. Which tells you that the millions of Apple laptops being bricked right now instead of being recycled could have some amazing use if it were possible to wipe them clean and reuse. Sigh.

andai · 2026-04-16T18:49:37 1776365377

Well, we've set it up so the survival of employees and their families is tied to old products being bricked.

anthk · 2026-04-17T07:27:12 1776410832

They created a better Pacman for Atari 2600, better Outruns for Amiga's and whatnot.

tty456 · 2026-04-16T19:40:24 1776368424

Where's the code for the actual HyperCard and building of the .img? I only see the python validator in the repo.

hammer32 · 2026-04-16T19:49:17 1776368957

The stack is the code. You can view it directly for each button or examine the per-page script. As far as I know there isn't a compiler that lets you write standalone code and turn it into a stack. The stacks are dropped into Disk Copy disk images to preserve their resource forks. Both modern macOS and Git both strip resource forks, so the disk image is the only reliable container for distribution.

tty456 · 2026-04-16T20:17:05 1776370625

So a hypercard is compiled machine code of button clicks and key presses? Weird. I guess that could be macro'd somehow

hammer32 · 2026-04-16T20:27:15 1776371235

HyperTalk is an interpreted scripting language. The scripts are stored as plain text inside the stack and interpreted at runtime. It's kind of like a Visual Basic form where the UI and the code live in the same file. You can open any script, read it, edit it and immediately run the newly edited script.

rcarmo · 2026-04-16T21:24:56 1776374696

Neat. Looks like I found my new benchmark for my ARM64 JIT for BasiliskII :)

(still debugging it, but getting closer to full coverage)

hammer32 · 2026-04-17T11:36:39 1776425799

The training loop is heavy on floating-point math and string parsing, so it should exercise the JIT nicely. I'd love to hear how it performs!

immanuwell · 2026-04-16T18:40:26 1776364826

The architecture of macmind looks pretty interesting

hammer32 · 2026-04-16T19:10:18 1776366618

Thank you! The constraints made it interesting. HyperCard doesn't have arrays, so the entire model, weights, activations, gradients, is stored as strings in hidden fields. All of the matrix math is done with "item i of field".

DetroitThrow · 2026-04-16T14:59:53 1776351593

This is very cool. Any more demos of inference output?

hammer32 · 2026-04-16T16:40:04 1776357604

Thanks! The quickest way to try it is the HyperCard Simulator link someone just posted in this thread: https://hcsimulator.com/imports/MacMind---Trained-69E0132C — go to the Inference card, click New Random to fill in 8 digits, then click Permute. The model predicts the bit-reversed permutation of all 8 positions. The pre-trained stack gets all inputs correct.