Spam

Spammers are nasty little pieces of work.

It’s been a constant cat-and-mouse game where we (anti-spammer people!) take a few steps forward, then the spammers hit us back twice as hard.

This time, they’re doing some kind of distributed dictionary attack. So that means that thousands upon thousands of computers across the globe are all trying to send mail to various mailservers (including one I’m responsible for) looking like “joe@domain.com, jack@domain.com, jeb@domain.com, jorge@domain.com…” for several domains that we host.

The problem is – they are slamming the servers so hard that they’re starting to overpower the DNS blacklists we use to block spammers. And they’re not showing up in the blacklists always.

So my idea was to find out when someone fails to send mail to 5 or 10 accounts in a row, and then add them to a blacklist. I wrote a simple PHP script to do that, and it works…eh, okay. Not stellar. I even added in a piece that kill -9’s their smtp process when they get listed, it doesn’t always seem to work right. Maybe they’re coming in 20 times at once, or something.

So I’ve run my little blacklister script for a while – and as of press time I have about 5100 IP’s in my block list. And it doesn’t really seem like it’s getting any better. I finally turn on ‘record entire SMTP conversation’.

So this is what they’re doing –


HELO IMASPAMMER
MAIL FROM:<somelikelyinnocentvictim@somerandomdomain.com>
RCPT TO:<joe@domain.com>
RCPT TO:<jack@domain.com>
RCPT TO:<jeb@domain.com>
RCPT TO:<jorge@domain.com>

To which it gets answers like:


451 No such user 'joe@domain.com'
451 No such user 'jack@domain.com'

etc.

So here’s the clever bit – then they do:


RSET

Which apparently just ‘resets’ the SMTP communication, and start again to do the next five recipients. Ugh.

So now it’s time to dust off the ole C coding, and I’ve rewritten the ‘rset’ command to now say:


502 Just send your mail again, don't pull this RSET garbage.


And disconnect ’em. That seems to have helped a lot – with the spammers having to reconnect, they get a second chance to get looked-up in the blacklists, or checked against on the my own custom blacklist. Load is reduced – though not eliminated. I guess we’ll see how well it works.

My next thing will be to augment this username-check with a counter, and if the counter goes about ‘n’ bad lookups, bounce the connection. That could help as well – but I don’t think by as much as what I’ve done so far.

Browsing Nirvana – Achieved?

So as anyone who has worked with me since around 1995 or so knows that I am a notoriously heavy browser of the web. Since tabbed browsing came out, I have been using a complex two-level hierarchical system to manage my web pages. I’ll have a window that has a general sort of topic – like maybe a web page I’m developing – plus several tabs for some php functions that I’m using or MySQL documentation or whatever. And several other windows set up similarly – sometimes ‘singletons’ for various links I’ve clicked off that people have AIM’ed or Twittered to me. The end result is I can never find anything, and when my browser crashes, I’m screwed. This is why I’ve been so excited about Google Chrome coming to Mac, and about Stainless browser for Mac. Which I still play with, and is getting better every week.

However, I think I’ve just made a change that might have switched up how I use them all. I’ve added a third layer of hierarchy using Fluid. Fluid lets you make little Site-Specific Browsers (SSB’s) for websites you keep open all the time. They show up on the dock as separate applications with their own sets of windows. Basically indistinguishable from a regular Mac OS X application. So here’s how it’s made a difference for me. There are certain sites that I keep open all the time, and certain sites that I’m just browsing and not finished with (hence the window staying open). The ones that I keep open all the time I’ve made little SSB’s for, and closed their windows within my main Safari application (Safari is today’s browser of choice, I switch back and forth from Stainless lately). Now, when I’m trying to find something that’s in one of my always-open applications, it shows up in the dock. I can command-tab to it. Once I’m in it, I can command-squiggle (tilde) to the correct window. Anything that isn’t in one of my always-open applications is in my regular Safari, which only has 5 windows of its own to flip through.

It may sound insane, and probably is, but now that I have this new third layer of hierarchy I feel like a great weight has been lifted. Whereas before I would have to go through ruthless window-culling rampages – “Seriously, I’m not going to do anything about this thing I’ve been sent, I’ve left this window open for 3 hours, let’s accept I’m going to do nothing here and close it” – now I don’t need to, because I can get to everything I need. Furthermore, as a bonus (though I haven’t seen it in action yet) I should have some level of crash-isolation – it should hopefully only knock down one of my SSB’s, and not everything. We’ll have to wait and see how that turns out to be.

I’ve tried Fluid before, once, and it didn’t stick. This time, I still have one main problem – cookies won’t pass between SSB’s and/or Safari. For most people this may be okay but it’s annoying for me. Not a huge deal, just annoying. The other thing I did was spend a full 30 minutes or so making sure I had identifiable icons for my SSB’s – this has helped IMMENSELY. Why they haven’t set up a protocol for this that just requests the icons from the websites is totally beyond me, but, whatever, I just did it and it looks…mediocre. Which is good enough for me! I even made a little icon for my own web application that I run all the time.

I shall report back with how it goes, but it really does feel like a huge weight has been lifted from my shoulders right now.

Rails Documentation

Is the worst fucking thing on the planet. I’ve actually googled for stuff, clicked on it, and gone to redirecting cybersquatter pages, it’s so goddamned bad. Maybe I’m spoiled. The bulk of the professional development I’ve done has been with PHP, though I was pretty heavy into Perl, Tcl and other such languages at their time. Compared to any of them, Rails documentation is, hands down, the absolute worst.

Half the time I feel like they’re being too goddamned clever for their own good. But the ‘sensible defaults’ that they espouse aren’t documented anywhere, so how the hell am I supposed to know what they are? What seems sensible to me might not be sensible to you. I’ve found myself drilling down into source code more times than I’d like to count to try and figure out what’s going on. That is total and complete fail. It’s lucky that it’s so powerful and cool regardless, or I would’ve left it in the dust a million years ago.

Maybe I have to be more…loquacious in PHP. That’s fine. At least I know what to do and how to do it. 70-80% of the time I’m working in Rails, I have no friggin clue how to tell it how I want to do something. Then when I find out, it’s always something like – type two magic words into the right file, then Rails reads your mind. Awesome. I just hate that sickening feeling during that not-20-to-30 percent of the time. I feel helpless.

Then when you do find documentation, it’s all stories. “So here’s what active record aims to do, here’s different ways you can make it do things, blah blah blah.” I like my programming docs terse. I look it up, it tells me what that does. But the documentation, especially, just seems all jumbled together and awful. Or the other thing I’ll find is the opposite granularity – “Class Foo::Helper::Doodad::fwipple::dingus has methods ‘get’,’put’,’set’,’be’,’execute’. The source code to method ‘execute’ is: …….” That doesn’t help either. That’s why it’s called DOCUMENTATION. Not fucking SOURCE CODE. I feel like it’s some kind of ‘hipster’ framework – if they actually explained it to you, and regular unhip people “got it”, then the hip people would all switch to using Scala.

And, embarassingly enough, I only just ‘got’ the yield command in Ruby. That’s just sad, man. Though I don’t see the difference between a yield and an anonymous function, but I guess I’m just not that bright.

I assume it’s one of those things where as soon as you buy into it 100%, completely, and spend time just soaking in it, then you’ll fully understand. But I don’t like having to commit to that level of buy-in. I’ll continue to fiddle with it, and even choose it as a framework in whichever contexts it seems right for, but I’ll always look slightly askance at it – perhaps until I’ve been so steeped in it that I can’t look at it objectively anymore. But until then, fix your fucking docs Rails, it’s horrible.

divs vs. tables, part II – the compromise (maybe?)

<div class=’tablesque’>
   <div class=’rowesque’>
      <div class=’cellish’>A</div>
      <div class=’cellish’>B</div>
   </div>
   <div class=’rowesque’>
      <div class=’cellish’>C</div>
      <div class=’cellish’>D</div>
   </div>
</div>

stylesheet:
.tablesque { display: table; }
.rowesque { display: table-row; }
.cellish { display: table-cell; }

There – it looks like a table, because you told it to look like a table in the CSS. But the markup doesn’t say it’s a table – it just says you have a hierarchy.

I sorta fell into this idea because I’m working on making a web application work for iphone or for a regular browser, and in the plain browser context I wanted something to be a table, but on the iphone, I wanted it to act more like spans and divs.

To give you an idea of what a moron I am, you should know my first idea was to have a big table, and on the iphone, do things like: display: block, display: inline, etc. But the iPhone (and even Safari on the desktop) had problems with letting me convince it to display tables as non-tables. So finally I switched it to divs, and made the regular browser side do display: table, display: table-row, display: table-cell. And that seems to work okay for now.

So, standards people, there, I’m standardy. My ‘layout-like-a-table’ CSS is all in the CSS. I think this CSS looks a hell of a lot prettier than the crazy ‘float, clear, width, etc’ routines. And it should stretch better based on its contents á la tables.

As a bonus, within the table DOM stuff I don’t have mysterious invisible ‘tbody’ tags that chuck themselves in my table. I lost 3 or 4 hours to that a while ago.

Bravo, MS!

Never thought I’d be saying that…

But I ran into a couple of different intertube posts that talk about the new MS ad campaign that says “Macs=expensive”. (Here’s Engadget’s.)

I’m a huge Mac lover, and were technology company/human marriages legal, perhaps I would’ve married it (sorry Nicola…). But they can actually make a valid, salient, understandable point here, so more power to ’em.

Now, the real point here is the value for what you get – yes you can buy a computer for $1000 or $5000, the same way you can buy a car for $10,000 or $50,000, it depends on what you’re looking for and what value you’re getting. But, that’s a complicated argument and isn’t going to compare to: “Teh Macs are Expensives!” There’s also an undercurrent of “Macs are for latte-sippers!” and that’s pretty subtle, and also valuable.

I kinda feel like Apple has grown a little…comfortable, perhaps, lately. So I like the idea of MS really breathing down their necks to keep them from becoming too complacent.

I mean, we are in a down economy, letting people buy cheaper stuff becomes a good idea.

Most of MS’es advertising attempts have either left Apple completely unmentioned, or been just completely pointless. This is the first one that actually seems like it has a message, and could cause a little motion in the marketplace. Good on ’em. About time they did something right.

Now let’s see Apple’s response where they come out with some more ‘everyman’ style pricing.

Facebook Connect (FBConnect) And XHTML

So I’ve dabbled in Single Sign-On solutions a couple of times before, and none of them have struck me as particularly usable. They all function, certainly – but the amount of technical acumen required to try and get them up and running; or to actually use them tends to be too great.

So Facebook has been around for a while, they have a rather widely-used API, and now they’ve announced “Facebook Connect” – a single sign-on solution that uses your FB credentials to log you in to an external website. Nice.

So, on a lark, and borne of my extraordinary laziness and lack of desire to input yet-another-set-of-credentials-into-another-form, I added FB Connect to a personal app of mine. It only took me a little bit of an evening and an hour or so in the morning. I was surprised. It works pretty well. I didn’t have to totally overhaul my application – which is always nice. One thing I found that was very weird was that everything is asynchronous. You do all your work in Javascript. I would expect to do code like – “okay, facebook, make sure this guy is logged in. Then, do this other thing, and so on.” That’s not how it works, it’s all callbacks, because it’s all done asynchronously. You instead say, “Okay, facebook, make sure this guy is logged in, and as soon as you’re sure he is, do this whole big other thing for him.”

It’s an odd way to program, but probably the absolutely right way to design this API. Very AJAX-friendly. Okay, we’ll come back to this in a minute.

I’ve been working on some web stuff lately – both experimenting and trying to do real useful stuff – in XHTML5. I’m tired of people saying that certain things aren’t possible, when in fact, they are; they’re just really hard. So even though I myself have taken XML (and, by extension, XHTML) to task, I thought I’d give it a try.

I even did the super-nerdy “Serve as application/xhtml+xml to those who can read it, text/html otherwise” and that even worked. I did some CSS stuff that IE can’t read, but if you’re using IE, I don’t care about you that much. I’ve even tried to make it so the XHTML contains no ‘presentational’ tags – like div’s for forcing breaks, etc. And it looks kinda crappy, but I’m still working on it. The Content-Type trick I used didn’t work at first, but I got it to work all of the sudden and I watched my pages break right before my eyes. It really just helped me find missing close tags, I didn’t find it all that jarring.

The whole thing got me to thinking – it really seems that XHTML 1.1 and 2.0 are the real ‘problem’ here, and that XHTML 1.0 may actually be a decent standard. It seems to work OK. And the idea of mixing tags together from both XHTML and another namespace in order to get the best of both sounded appealing – and yet, that’s precisely what the Facebook Connect API does! It creates “XFBML” which turns into facebook-branded login buttons and so on. So I thought I’d give a quick look to my just-recently-built FB Connect app and see if I could do the same XHTML nerd thing I did before. Seems pretty obvious – I mean, even the <html> tag has namespace attributes for both xhtml and xfbml, it sounds just like this stuff is real XHTML and it should just work.

*****BZZZZT!!!!**** WRONG! Big fail. It does not work at all. It seems to be dependent upon HTML javascript rules. One of which I was able to work around, the other; they will have to. And there may of course be other problems buried in there that I can’t see yet because it doesn’t yet work. Hell, I even copied it to my website and tweaked my local copy and am still having trouble getting it to work. I got to the point where I am trying to de-obfuscate code, so I think that means “I lose”.

So this is a very weird world we’re now in – here’s one of the highest traffic websites in the world (fifth most popular at the time of this writing according to Alexa), and they’re encouraging people to write this hybridized, weird semi-xhtml/semi-html markup. The thing that I find most impressive is that it even works at all, to be honest.

And it does work, and works well. And lots of people are using it, as far as I know, but I haven’t heard anything about its general unusableness in XHTML. It just makes me wonder, is all. Is validity really important? Certainly, it makes sense to define a language and have everyone try and implement it to that standard, but going nuts about valid this and valid that does seem a little…I dunno…disconnected. Or, more likely, the Web Powers That Be have dictated that XHTML = dumb, so people are just making stuff in HTML4.01 or HTML5, hence no one noticing that FB Connect is totally broken in real XHTML mode.

I think Facebook cares about what *works* and doesn’t give a shit about what’s *valid*. And I think the standards committees have historically cared about what’s *valid* and haven’t given a shit about what’s *worked*. And I am willing to bet that somewhere in the middle is where we should be.

The irony here is that the feature I despised most in XML – namespaces – *could* find its way into HTML5 (Hey, FB is using it!). And HTML5 is really a reaction to some of the more horrible missteps made in XHTML 1.1 and XHTML 2.0. Which is the primary form of XML on the web. I’ve got to admit, though, it does certainly show a somewhat pleasant way to extend HTML – the HTML stuff goes in its namespace, the crazy crap you just invented goes into your own namespace, and if I can’t read your crazy crap, I won’t display anything. Deal? Deal.

Edit – Stupid Blogger preview doesn’t match what you post. Whoops. Added back those pesky carriage returns.

Another new Braydix

http://desk.nu/Universix.iso

Now it's starting to get out of the proof-of-concept phase, and into the alpha release stage, almost…

Features:

  • SSL
  • Actually loads images (nice, right?)
  • Should have better VESA/VGA support.
  • (some testing has worked with booting via USB thumb drives, but I don't think this ISO will. Maybe that means I can’t count it as a feature? I dunno.)
  • WIRELESS – using the familiar and intuitive Ctrl-Alt-F2, wpa_gui -something_something, ctrl-alt-F1 method – how easy!
  • Almost installable – if you already have a formatted and correctly
    labelled ext3 partition, you can copy it to your HD.
  • Will run in 128MB, maybe even as little as 64MB.

Uses:

  • Security dorkwads – bad men can't 0wn what isn't 1nst@ll3d
  • Kiosks dweeblings – no GUI other than a browser
  • Old, decrepit POS computer-hoarders – this will actually run on your crappy computer
  • Remorseful netbook owners – this will actually let you do something useful, like checking out Facebook, on your otherwise useless would-be laptop. This is my current use case.
  • Bloat warriors – next time you see a completely horked Windows XP installation or something even worse limping along painfully, know that with 120mb of storage, and 128mb of ram, you could be doing soooo much better…

Unfeatures:

  • Doesn’t have a pleasant installer. Won’t format your drive (though, this could be a plus…)
  • No bookmarks – c’mon, isn’t that what Google’s for?!
  • No disk cache
  • Doesn’t utilize the memory it saves (yet)
  • Doesn’t save wireless settings (by default)
  • Disk image is more bloated than it should be
  • no sounds?
  • No Flash (this is a problem…but solvable?)
  • static IP’s aren’t yet supported (DHCP only)
  • Nasty way of invoking the wireless configurator – and it’s not that friendly

So if 99% of what you do for fun or for work ends up with you browing the web, why have your computer do anything else?

Pets make you stupid

My good friend, former co-worker, and current client Beckley once told me that having a pet makes you stupid, which he thought explained what he called his cat (which contained, if I recall correctly, “Chairman Miau” and “Monkey”). edit – the neighbor’s cat was Chairman, Beckley’s cat is The Monkey. He documented the slow and horrible transformation of his cat’s name on his blog.

I am afraid he is completely correct. These are the names I have used for my wife’s dog:

  • Lucy
  • Lucy Loo (Lucy Liu?)
  • Lucy Doops
  • B. Doops

So we’re now to the point where we refer to the dog as B. Doops. It’s not even as if I know what the B stands for. That’s just what she’s called now. I may have mispelled it – it could be Bedoops or something. Who knows.

So be careful. If you get a pet, it will make you dumb.

Edit – Lucy Loob, Luby Doo, and Luby Doob probably gets us closer to “B. Doops”.

Watchmen Review

So after walking out of the IMAX that was showing the Watchmen, I didn’t know how I felt about the movie. I twittered that I had seen it and that going out of one’s way for the IMAX version was not worth it. But did I like it? The following contains no spoilers, because I fucking hate spoilers and had to keep plugging my ears and singing “LA LA LA LA I CAN’T HEAR YOU” whenever anyone tried to tell me anything about it.

The answer is, that yes, yes I did. I really only realized what was going on in my head when I saw the last half of Spiderman today – which I think I’ve seen like five times..

Y’see there are Superhero movies like Spiderman, X-Men, and Iron Man. All of which I truly enjoyed. Interesting characters, good plotlines and subplots, great action sequences, little funny bits – all around good stuff.

But the Watchmen is not that kind of film. It’s closer to a Sci-Fi film, or even some character driven piece like Tarantino’s work. Sure, there’s some plot in there as to what’s going on, and there are some cool fight scenes, but that’s not what the movie’s about. It’s got that Sci-Fi “what if…” feeling about it, and also asks “what would hero types actually be like if they really were in the contemporary world?”

And getting those two types of movies mixed up is my mistake. This movie isn’t that movie. It’s very good, and interesting, but it’s not an action movie, and it’s not a superhero movie. Keeping those things straight makes it easier to think about and realize I enjoyed.

But if you’re not the type of person who wants to watch a movie that you could walk out of and wonder whether or not you liked it, then I wouldn’t watch this one.

But if you’re okay with that, I think the film is very good – and based on my brief readings of the graphic novel, pretty close to it.