Thursday, July 23, 2020

Why does everyone Care who's Jewish?

I actually wrote this over 2 years ago, but just found it in my drafts. I think I might have tweeted about it though. When formatting this as bad HTML, I noticed some mistakes in the ratios, so I fixed them. I assume the data is still good. Anyone can try to verify if they're interested.

Anyway...:

I've wondered for a while, based on probably flawed confirmation bias, that Wikipedia might have a bias of people wanting to know if other people are Jewish.
My brain made the connection that the "early life" heading was really the "is this person Jewish?" heading.

So, I've done some Google-fu to try to get a reasonable answer, the lazy way.

Search terms vs results on Wikipedia:

Search terms"Jewish"-"Jewish"
"Early life"1.08m14.2m
-"Early life"61.6m866m

That's a ratio of 57:1 of Wikipedia pages mentioning the term "Jewish" having no Early Life section to those that have an Early Life section, and 61:1 for non-Jewish-mentioning pages. That's probably not statistically significant. Let's assume it's not.

Ok. That's not too bad. Oh wait... That also includes lots of pages that aren't even about people.
Now let's filter out some noise by adding the search term "living people" which is a category for... living people.
Search terms"Jewish"   "living people"-"Jewish"   "living people"
"Early life"53.3k141k.
-"Early life"217k12.3m

That's a ratio of about 4:1 of Wikipedia's "living people" pages mentioning the term "Jewish" having no Early Life section to those that have an Early Life section, and 87:1 for non-Jewish-mentioning pages.

So, what's up with that? People really seem to want to know if people are Jewish.

Friday, July 3, 2020

Prank Defeated - Formerly: Introducing Inference: A Neural Network Powered Programming Language

OK, a few months ago, I wrote this post, as a bad joke, and scheduled it to be published on April 1st, 2021. Since then, I've had a chance to play around with OpenAI's AGI playground, and while I've only produced some fairly funny interactions and other basic uses, other people are doing some crazily impressive stuff with it, like this:



So I'm posting this thing early, because why not, but instead of it being a prank, it's more a prediction of the future.


Inference has been created to define a new paradigm of programming languages.

Where all previous languages required programming using a rigid syntax, where semantics were precise, and no leeway existed, Inference uses the power of neural networks to infer meaning from the programmer.

The basic model works like this:

Programmer writes code however they want

For example:
  1. var x = position + 8
  2. add a margin of 8 to the position - keep track of that for me (call it x)
  3. let a new variable (let's say x) hold that variable I just used, plus the fixed margin

Training

The neural network ingests the code that's been written, and based on its assessment of certainty of each statement (taken individually, and in a sampling of broadening contexts), quizzes the programmer for meaning, generating educated guesses about the meaning, as well as allowing the programmer to correct it.

For example:
Regarding let a new variable (let's say x) hold that variable I just used, plus the fixed margin, did you mean:
a) int x = position + fixedMargin  [92% certainty]
b) int x = position + [?? supply reference to "fixed margin"] [98% certainty if reference supplied]
c) int x = position + top [86% certainty as fixedMargin used after top]
d) [Let me know more precisely what you meant]
As this is a trained system, the more consistently the style is written, the easier it will be to train. That might make it seem pointless, except it is able to store multiple profiles, for each developer, meaning multiple programmers can each use their own style, and train the model to understand what they each mean.

Performance

So far, this is in alpha, but with just 3 days of training on pseudocode written by 3 developers, it has been able to understand, compile and execute a simple Pong-style game, written in plain language by those same developers.

More info

Code and infrastructure config will be released soon.
Tweet at me for more info.