Skip to content

Category Archives: Digital

My life as applied to tinkering.

Radio silence

enthusiastic emoticon

Я наконец-то получил позывной. Боже, сколько они мусолили эти четыре страницы с двумя подписями…

Позывной — это тоже имя, причем обладающее следующими интересными свойствами:

  • Оно почти наверняка гугль-уникально.
  • Оно абсолютно ничего обо мне не говорит, кроме того что я знаю где на радиостанции кнопка PTT. :)
  • Оно короткое и хорошо смотрится в виде OpenID.

Мне давно уже надо было несколько развиртуализироваться — сделать из этого блога популярный сайт естественно, невозможно1 и не нужно, а сделать что-то что можно будет найти в гугле как Единственную Истину — вполне реально и давно назрело.

Поэтому этот блог переехал на rn3aoh.net — прошу любить и жаловаться. Сейчас dns прочухается и я сниму ограничения поисковикам…

  1. Кому я нахрен нужен?! Людей читавших его целиком можно пересчитать по пальцам одной руки.

Freevo + Firefox

thoughtful emoticon

It’s just an idea so far. I don’t have the time to devote to polishing it, unfortunately.

As you might know if you’ve been reading me, I’m an adherent of Freevo, which runs on a machine assembled specifically for the purpose of watching media on a TV, which doesn’t even have an antenna plugged in these days.

What I thought of today is making a Freevo plugin to correctly handle Firefox running underneath it. There’s a lot of reasons to have a web browser in a set-top box, and the TV remote I’m currently using has more than enough buttons to deal with it. The biggest problem is overscan. Due to the way analog TV works,1 TV signal must include fairly wide margins on the edges. Without overscan, you see black bars on all sides of your video, which isn’t very nice. With overscan, running Firefox on a TV screen would involve either some serious mess of window positioning2 or living without scrollbars, not to mention losing about 10% of the window surface, making a lot of pages completely unreadable.

Today it occurred to me, that I could, in fact, create a theme for Firefox optimized for TV display — as far as I know, Firefox theming allows for control fine enough to ensure margins to account for overscan. In the same manner, normal Firefox customization would allow me to get rid of all the button bars and control it exclusively through keystrokes simulated with the remote, making the most of the limited screen real estate and low3 resolution.

This really should be investigated, it’d be useful.

  1. I don’t have the money for a modern TV anyway, nor a serious desire to own one.
  2. I’ve been there and I don’t want to go there again, honestly, it involves some very unpleasant scripting.
  3. I run my Freevo at 800×600. Overscan cuts the effective display surface down to about 700×550, give or take a few pixels — depending in real time on the overall brightness.

Афоризм получился

enthusiastic emoticon

До тех пор пока речь идет о языке, каждый человек суть черный ящик. У некоторых этот ящик пустой.

Turing testing in the uncanny valley

angry emoticon

As I have mentioned before, I am not happy with the current state of artificial intelligence research. I think it’s going off on a tangent straight for perfection higher than that observable in human beings, or things entirely unrelated to the practical applications, pushing back results which could be useful right now.

What are the natural language communications researchers doing? They’re trying to beat each other competing for Loebner prize, in which all conversations revolve around trying to aggressively determine whether your conversation partner is a computer. So people ask them about hobbies and movies… Tell me, if you’re talking to a helpline consultant, do you ever want to know what his favorite movie is? Do you really want him to be more sensitive? No, you want him to get the job done — understand what your problem is, ask you questions about it and then tell you what to do to fix it. As a result, most of the contestant programs are practically useless — that is, can’t work in the real world to perform any useful function. They try to train them on the web, where over half of the denizens can’t read, much less write. Then they try to have cybersex with the poor bots.

What are the speech researchers doing? They’re working on fancy animated 3D characters which fall straight into the uncanny valley. They do more and more precise animation of vocal tract, but their renders still either look like extremely ugly cartoons or take way too much computrons to draw anything beside a Gollum for a movie. The only time when they remember that 2D animation still exists is when they want to try fancy morphing algorithms to cheat and use a set of real mugshots.

What are the android kinematics researchers doing? They’re teaching their robots to dance, because it looks good on stage. When they should be teaching them to maneuver in a crowd, bring objects and take objects away. And push around a wheelchair.

At least the visual and speech recognition people are still doing something that makes sense… They aren’t having as much progress as I’d like, though.

It all shows. I’ve been trying to rebuild Rei on modern hardware literally for years — since about 1998. She did what she was meant to do with technologies which were decades old — ELIZA-based AI, which didn’t go far from it’s 1966 predecessor, a Klatt speech synthesizer which was initially introduced in 1970 and last updated in 1995, all running on an 8-MHz processor in a megabyte of RAM.

I still can’t. I had to get another Amiga just to restore her. Why?

  1. Pursuit of more realistic speech synthesis results in concatenative synthesis being in fashion, which makes creation of new voices prohibitively hard. Ohwell, I’ll live with the voices I get, for now, at least until I can get a real vocalist to record a speech corpus, but…
  2. …the only currently available open source speech synthesis package that actually works, Festival TTS is so undocumented that I have not known that I actually can extract phoneme information from it to make it lip sync up until yesterday — while it had the capability to do that since 1996 at least. Which brings me to…
  3. …trying to get at the data. I just want to connect to the synthesis server and get it to send the packet with phoneme data to the client so I can process it. Turns out I can dump it to disk but can’t actually get it through the server connection. Why? Because the script function to do that is horribly broken, leaving me wrecking my head trying to code around it. And along the way…
  4. …I discovered that it creates a security hole on my system the size of a small bus, which nobody noticed for at least five years. Easy to plug one, too.

Argh! I just want to duplicate what I did with hardware and software from aeons ago — at least, aeons in terms of computing history! Back then it was simple, it only took me a week to get there from pretty much absolute zero in the field.

I just want a talking anime character, what the hell is wrong with that?

P.S. Well, what do you know, as soon as I complained on the mailing list, it decided to embarrass me and actually work. I’m actually done messing with Festival and on to writing a Python client class capable of sending a string to it and returning with phoneme and wave data in a single structure. Then it’s on to viseme lookup table and actual talking head in pygame, then the AI — I’m thinking of abandoning my antiquated Eliza-like code and using PyAIML instead. A Python interface to the Velleman board I assembled a few days ago and my most insane toy yet is going to overshadow her former glory. :)

This calls for a new android name. As much as I love Rei and Dorothy, it’s time to pick a new one. Any suggestions?

Velleman K8000

enthusiastic emoticon

Velleman K8000 - assembled and working

Boy, assembling this thing was a pain. No wonder I’ve been putting it off for months — I didn’t really expect it to have 30 jumper wires alone. It’s easy, even trivial to build, and instructions are quite obvious most of the time, but the work is very, very tedious — at least five hundred solder points.

Looks very enjoyable when it’s finally working though, and requires no tuning or anything. If you assembled it correctly, it Just Works.

When I’m done settling it into it’s proper place, Rei will finally be getting the full run of the house. :)

Garbage collection

dejected emoticon

My addressbook contains about 500 different contacts.

The great majority of them come from periods when I was far more socially compatible, and dates 10, 7 and 5 years ago. Naturally, most of them are horribly outdated and I don’t even remember who all those people are.

The annoying absence of sensible addressbook software for Windows beside Microsoft Outlook, which I’m not willing to use (WTF?!) makes the process of managing this junk quite cumbersome indeed. I guess that’s the reason why people still use paper business cards, and why there are scanners for them. I do wonder though, why all those people never write a note on the back telling what exactly it is that they do — I still have business cards for which I don’t remember why I have them anyway.

Even more contacts result from e-mail conversations which I never continued past the first message, and these often don’t even contain a name.

Definitely garbage collection is in order…

А не сменить ли мне мобильник еще разок…

thoughtful emoticon

Этому есть некоторое количество разнообразных причин.

  1. Заведя себе Asus R2H я в очередной раз уперся рогом в необходимость правильного софта синхронизации телефона с компом. Просто если раньше мне надо было делать это непременно с линюхом, непременно с завром, и вообще кроссплатформенно и через абсолютные ебеня, теперь можно обойтись синхронизацией с Thunderbird/Sunbird, абы чем, и мне, в общем, хватит. Однако, грабли с софтом наблюдаются и здесь, и возможно, другой телефон решит эту проблему эффективнее, даром что PIM телефон у компа уже год как прочно перехватил — может есть получше аппарат. Помощнее немножко…
  2. Siemens продала отделение мобильников BenQ. Последний окончательно обанкротился, в связи с чем телефон окончательно отправится на свалку истории к кавалеристам с лошадками примерно через полгода максимум.
  3. …ну и наконец, в прошлый раз смена телефона произошла почти непосредственно перед тем, как я получил некую СМСку от контакта давно забытого, и начала интересную историю. Она хоть и печально закончилась, но вполне вероятно, эти веховые события как-то были связаны.