spacer
spacer search

SwissCenter

Search
spacer
Main Menu
Home
Documentation + FAQ
Screenshots
Downloads
Forums
Bug Tracking
History
Login
Username

Password

Remember me
?
No account yet?

Locations of visitors to this page

 
Home arrow Forums

SwissCenter Forums  


Modded ofdb.de Parser - Fieldtest - 2008/09/24 08:44 Hi!

I have modded the ofdb.de Parser to use a different search url, this
works very well for me, i no longer have any titles in my database which are in ofdb but weren't found by Swisscenter.

But thats only for me so far, after a quick exchange with Pernod, he agrees that the change would make sense, though needs some extensive testing.

Now thats were you come in, if you are using ofdb parser, and want to help, please take the one attached to this post, test it, and give some feedback here.

If we get good results, the changes might make it to the SVN

Kind regards

KMan
File Attachment:
File name: www.ofdb.de_KMan_Mod_20080924.zip
File size:2281 bytes
Player : Pinnacle ShowCenter 250HD (wired)
Server : XP Pro SP2+ (Simese 1.45, SwissCenter current SVN)
Spec : Intel Pentium D830 (DualCore) 3GHz, 2GB RAM
  | | The administrator has disabled public write access.
Re: Modded ofdb.de Parser - Fieldtest - 2008/09/24 09:39 KMan, thanks for providing this.

I got mixed results:
I haven't received results for the following example:
- Brothers Grimm / Gebrüder Grimm (both spellings tested)


At the moment I don't get anything, maybe I'm temporary blocked by Google (this does happen if Google thinks you do too many searches), so I'll check later.
2x SC200 (wired), SwissCenter latest SVN, Simese 1.40
http://www.knecht-ruprecht.info
  | | The administrator has disabled public write access.
Re: Modded ofdb.de Parser - Fieldtest - 2008/09/24 10:10 upD8R wrote:
KMan, thanks for providing this.

I got mixed results:
I haven't received results for the following example:
- Brothers Grimm / Gebrüder Grimm (both spellings tested)
....


Hi,

first of all thanks for taking part in this field test!

I checked your result here:
Brothers Grimm
works for me

Gebrüder Grimm isnt listed as title within ofdb.de itself, so thats why it cant get a match.

you can easily doublecheck the parser by simply entering the search term in ofdb and see what matches come up, then maybe adjust the tilte accourdingly.

e.g. searching ofdb.de for Gebrüder Grimm returns:
1. Gebrüder Grimm's Schneewittchen / Snow White (1997)
2. Wunderwelt der Gebrüder Grimm, Die / Wonderful World of the Brothers Grimm, The (1962)

also if you set your loglevel to 6 or above you can see which google query is executed which you can check in your browser if something usable came up.

Since you are speaking of mixed results, what else isnt working ?
And even more important to know would be, if something isnt working with the mod, did it work with the original parser, or did it fail also?

Kind regards

KMan
Player : Pinnacle ShowCenter 250HD (wired)
Server : XP Pro SP2+ (Simese 1.45, SwissCenter current SVN)
Spec : Intel Pentium D830 (DualCore) 3GHz, 2GB RAM
  | | The administrator has disabled public write access.
Re: Modded ofdb.de Parser - Fieldtest - 2008/09/24 10:50 I don't have much time right now but I will do another test later on.

I just checked the URL with Log Level 6 and after executing the query in the browser I could see that Google indeed blocked my Swisscenter PC.

I don't know under what circumstances their SPAM mechanisms works but maybe we can introduce a timer-based workround?
2x SC200 (wired), SwissCenter latest SVN, Simese 1.40
http://www.knecht-ruprecht.info
  | | The administrator has disabled public write access.
Re: Modded ofdb.de Parser - Fieldtest - 2008/09/24 15:17 I don't know why but my SC server is still blocked at Google.

If I execute manually the query from the logs, I get redirected to http://www.google.com/sorry/

Grrr.

If I enter the alphanumeric code I do get the results in the browser but the SC query still fails.

So no more test results from here till tomorrow evening.
But I believe your query works well (from what I could see in the web browser).

BTW, Google recommends to delete any cookies if the problem persists. Does PHP store cookies somewhere or is it a full "cookie-free" query?
2x SC200 (wired), SwissCenter latest SVN, Simese 1.40
http://www.knecht-ruprecht.info
  | | The administrator has disabled public write access.
Re: Modded ofdb.de Parser - Fieldtest - 2008/09/24 15:23 It seems that your query fully fits Google's spam rule

I just tested from another PC (still the same public IP, though). What do you get if you open the following link:
Barbie
2x SC200 (wired), SwissCenter latest SVN, Simese 1.40
http://www.knecht-ruprecht.info
  | | The administrator has disabled public write access.
Re: Modded ofdb.de Parser - Fieldtest - 2008/09/24 15:39 upD8R wrote:
It seems that your query fully fits Google's spam rule

I just tested from another PC (still the same public IP, though). What do you get if you open the following link:
Barbie


This is weird,
i didnt get that sorry page even once so far,
executing your query i get
Ergebnisse 1 - 6 von 6 aus ofdb.de für allintitle:Barbie Die zwölf tanzenden Prinzessinnen -review . (0,99 Sekunden) with the desired match as the first result.

while testing i did lots of queries from sc and from within ff, in a quite short time, and no blocking message at all

Does it make a difference if you use this or this query ?
Player : Pinnacle ShowCenter 250HD (wired)
Server : XP Pro SP2+ (Simese 1.45, SwissCenter current SVN)
Spec : Intel Pentium D830 (DualCore) 3GHz, 2GB RAM
  | | The administrator has disabled public write access.
Re: Modded ofdb.de Parser - Fieldtest - 2008/09/24 16:14 KMan wrote:
Does it make a difference if you use this or this query ?

At the moment not, both work as expected. But I just needed to restart my DSL router, it seems the WLAN stuff is dying

But this is also good news as I just tested it with my small movie collection: 16 movies, every query is a hit but "Aeon Flux", obviously it catched the TV series.
So I renamed the movie in the details view and tried a manual refresh (just for this movie) with the button on the bottom of the page.

Well, what should I say? Here's the logfile excerpt:
Code:

 2008.09.24 22:09:36Including parser file C:\Dokumente und Einstellungen\All Users\Anwendungsdaten\Simese\Data\ext\parsers/movie/www.ofdb.de.php [2008.09.24 22:09:36Searching for details about AEon Flux Blicke der Zukunft ins Auge online at 'http://www.ofdb.de/' [2008.09.24 22:09:36Fetching information fromhttp://www.google.de/search?q=allintitle%3AAEon+Flux+Blicke+der+Zukunft+ins+Auge+-review+site%3Aofdb.de& filter=0 [2008.09.24 22:09:39No Match found. [2008.09.24 22:09:39Failed to access the URL.



Again, I get the Google's sorry page. KMan, could you try some manual updates? I don't know why this happens but it's annoying
On the other hand: How often do I have to update the details? If they are correctly stored in the XML you won't have any problem later on ... 2x SC200 (wired), SwissCenter latest SVN, Simese 1.40
http://www.knecht-ruprecht.info
  | | The administrator has disabled public write access.
Re: Modded ofdb.de Parser - Fieldtest - 2008/09/24 16:17 BTW, when I started writing my last post I could successfully click on both of your links. After the steps described above I fail now.

It seems my public IP is now "blacklisted" again. Strange, isn't it?

I'm not a virus, harharhar
2x SC200 (wired), SwissCenter latest SVN, Simese 1.40
http://www.knecht-ruprecht.info
  | | The administrator has disabled public write access.
Re: Modded ofdb.de Parser - Fieldtest - 2008/09/24 17:00 upD8R wrote:
...but "Aeon Flux", obviously it catched the TV series.
So I renamed the movie in the details view and tried a manual refresh (just for this movie) with the button on the bottom of the page.


This is because ofdb lists Æon Flux - Blicke der Zukunft ins Auge and not Aeon Flux - Blicke der Zukunft ins Auge, at least you have a lucky hand on picking the hard ones


Again, I get the Google's sorry page. KMan, could you try some manual updates? I don't know why this happens but it's annoying
On the other hand: How often do I have to update the details? If they are correctly stored in the XML you won't have any problem later on ...


o.k.
i did 22 manual updates as fast as i could, all succeeded, no error, no sorry page ???? i have no idea why you keep getting this
Player : Pinnacle ShowCenter 250HD (wired)
Server : XP Pro SP2+ (Simese 1.45, SwissCenter current SVN)
Spec : Intel Pentium D830 (DualCore) 3GHz, 2GB RAM
  | | The administrator has disabled public write access.
Re: Modded ofdb.de Parser - Fieldtest - 2008/09/25 14:27 It looks good, today I can test it and if the movies name is correct it's always a direct hit.



But my movie collection is quite small, i.e. approx. 30 movies!
2x SC200 (wired), SwissCenter latest SVN, Simese 1.40
http://www.knecht-ruprecht.info
  | | The administrator has disabled public write access.
Re: Modded ofdb.de Parser - Fieldtest - 2008/09/25 14:33 upD8R wrote:
It looks good, today I can test it and if the movies name is correct it's always a direct hit.



But my movie collection is quite small, i.e. approx. 30 movies!


Hi upD8R!

This sounds good so far! My collection any bigger either,
but i already had a few Movies in it which didnt get a match with the original parser, and were a direct hit with the modded one. And more important i didnt had any movie that worked with the original but fail with the mod.

again,
Thanks for testing.

I begin to wonder if the both of us are all german users here ???

Kind regards

KMan
Player : Pinnacle ShowCenter 250HD (wired)
Server : XP Pro SP2+ (Simese 1.45, SwissCenter current SVN)
Spec : Intel Pentium D830 (DualCore) 3GHz, 2GB RAM
  | | The administrator has disabled public write access.
Re: Modded ofdb.de Parser - Fieldtest - 2008/09/25 14:46 Not sure about the Germans, there are a few guys around like Joe_999 or NX70.

What I like to start is a review of the language file, I believe there are some minor optimization possible but it should be done by more than one

For instance, there is "Film-/DVD-Details bearbeiten" and "Details zu TV Sendungen eingeben". This is not consistent, is it? But that's off-topic in your thread
2x SC200 (wired), SwissCenter latest SVN, Simese 1.40
http://www.knecht-ruprecht.info
  | | The administrator has disabled public write access.
Re: Modded ofdb.de Parser - Fieldtest - 2008/09/25 15:00 upD8R wrote:
...But that's off-topic in your thread

It is indeed,

but for a start you could look at this thread
Player : Pinnacle ShowCenter 250HD (wired)
Server : XP Pro SP2+ (Simese 1.45, SwissCenter current SVN)
Spec : Intel Pentium D830 (DualCore) 3GHz, 2GB RAM
  | | The administrator has disabled public write access.
Re: Modded ofdb.de Parser - Fieldtest - 2008/09/25 15:13 KMan wrote:
upD8R wrote:
...But that's off-topic in your thread

It is indeed,

but for a start you could look at this thread

I see. You've got too much time.

Seriously, drop me a message if you need review support, harharhar
2x SC200 (wired), SwissCenter latest SVN, Simese 1.40
http://www.knecht-ruprecht.info
  | | The administrator has disabled public write access.
Re: Modded ofdb.de Parser - Fieldtest - 2008/09/29 04:32 @KMan

Please try the title "2 oder 3 Dinge, die ich von ihm weiß". This is found by the old version but not with your new one.

Sem
Player: Pinnacle ShowCenter 200
Server: Windows XP Pro SP3 (Simese 1.45, SwissCenter latest SVN)
  | | The administrator has disabled public write access.
Re: Modded ofdb.de Parser - Fieldtest - 2008/09/29 04:55 Another one which is catched by the original ofdb parser but not on KMan's:


Alien vs. Predator 2
Original vs. KMan
2x SC200 (wired), SwissCenter latest SVN, Simese 1.40
http://www.knecht-ruprecht.info
  | | The administrator has disabled public write access.
Re: Modded ofdb.de Parser - Fieldtest - 2008/09/29 07:16 upD8R wrote:
Another one which is catched by the original ofdb parser but not on KMan's:


Alien vs. Predator 2
Original vs. KMan

Hi,

this is interesting, for me both parsers fail, but maybe it is because of my setup, the search is trunicated after the dot:
Searching for details about Alien vs online at 'http://www.ofdb.de/',
nevertheless my parser fails because ofdb actually lists the film as Aliens vs. Predator 2 in the page title, and since i am only looking at that it doesnt catch the aka

You just found the first real fail on the mod
Player : Pinnacle ShowCenter 250HD (wired)
Server : XP Pro SP2+ (Simese 1.45, SwissCenter current SVN)
Spec : Intel Pentium D830 (DualCore) 3GHz, 2GB RAM
  | | The administrator has disabled public write access.
Re: Modded ofdb.de Parser - Fieldtest - 2008/09/29 07:23 semko wrote:
@KMan

Please try the title "2 oder 3 Dinge, die ich von ihm weiß". This is found by the old version but not with your new one.

Sem


Hi Sem,
i checked on this one and i can confirm this one fails also, though for some reason i dont really understand.
Google for some reason fails because the word oder, for some reason it doesnt work along with the allintitle keyword???? - weird behaviour though

i tried several combinations, but couldnt get my search to return a valid result aslong as the "oder" was in the title... as soon as i droped it from the title i got an instant match. This surely aint a solution though leaves me wondering how to search for "oder" in a pagetitle with google at all ???
Player : Pinnacle ShowCenter 250HD (wired)
Server : XP Pro SP2+ (Simese 1.45, SwissCenter current SVN)
Spec : Intel Pentium D830 (DualCore) 3GHz, 2GB RAM
  | | The administrator has disabled public write access.
Re: Modded ofdb.de Parser - Fieldtest - 2008/09/29 07:33 Ok,

here's the consequence of what upD8R and semko discovered sofar:

I modified the mod
What i changed is that it first tries to discover a title with the new searchquery and if that doesnt give an usable result it will automatically revert to the seachquery of the original parser and try again.

Hopefully we will get a better accuracy then with the old parser alone.

Feel free to test:
File Attachment:
File name: www.ofdb_kman.de_20080929.zip
File size:2430 bytes


And feedback is welcome

Kind regards

KMan
Player : Pinnacle ShowCenter 250HD (wired)
Server : XP Pro SP2+ (Simese 1.45, SwissCenter current SVN)
Spec : Intel Pentium D830 (DualCore) 3GHz, 2GB RAM
  | | The administrator has disabled public write access.
Re: Modded ofdb.de Parser - Fieldtest - 2008/09/29 07:34 KMan wrote:
Ok,

here's the consequence of what upD8R and semko discovered sofar:

I modified the mod
What i changed is that it first tries to discover a title with the new searchquery and if that doesnt give an usable result it will automatically revert to the seachquery of the original parser and try again.

Hopefully we will get a better accuracy then with the old parser alone.

Feel free to test:
File Attachment:
File name: www.ofdb_kman.de_20080929.zip
File size:2430 bytes


And feedback is welcome

Kind regards

KMan

Thanks, need to check it. But my whole small movie base is already properly tagged
2x SC200 (wired), SwissCenter latest SVN, Simese 1.40
http://www.knecht-ruprecht.info
  | | The administrator has disabled public write access.
spacer
 

Screenshots

www.flickr.com
This is a Flickr badge showing public photos from swisscenter. Make your own badge here.


 

Mambo is Free Software released under the GNU/GPL License.
spacer