Im making a program for converting Diablo 3 items stats into text from image, i made things work on AH but now i want it to scan the game in real time mode and grab all the item windows that player sees (that requires 3-5 screenshots per second).
I have some ideas on how to make recognizing item windows easier for the machine, as so:
set brightness and contrast of each screenshot to max, this should remove most of gradient places (wich are bad for my goal)
turn screenshot into monochrome image, this should make things ALOT faster, since each pixel may have only 2 states - black or white (please correct me if im wrong)
here are steps that ive made (in paint.net so far but i think it wont be hard to make same things in ImageMagick):
1) Original image:

2) Setting input level to 1 (i dont understand what it does, but it does what i need lol):

3)monochrome:

4) I need coords of these red spots (or at least top left spots, since width is fixed and height is very easy to detect by going from top left corner down till first white pixel):

I think 1 of possibilities is to remove all noise by, lets say, hard blurring image at some point and then monochroming it, but im not sure. But if i manage to totally remove all the black shapes of small size, then i could just search for first black pixels with functions that work with binaries.
Any way if some1 can help me or give an advice or link - this will be much appreciated.
Edit: i dunno why it cutted all the images, here are links to them on image shack:
1) http://imageshack.us/photo/my-images/651/49038602a.jpg/
2) http://imageshack.us/photo/my-images/829/23798932.jpg/
3) http://imageshack.us/photo/my-images/842/14352362.jpg/
4) http://imageshack.us/photo/my-images/688/18924584.jpg/