|
|
Welcome to the Invelos forums. Please read the forum
rules before posting.
Read access to our public forums is open to everyone. To post messages, a free
registration is required.
If you have an Invelos account, sign in to post.
|
|
|
|
Invelos Forums->General: Website Discussion |
Page:
1... 8 9 10 11 12 ...26 Previous Next
|
goodguy's Credit Lookup Plus |
|
|
|
Author |
Message |
Registered: March 14, 2007 | Reputation: | Posts: 4,695 |
| Posted: | | | | Quoting mediadogg: Quote: Dunno, but as we have together found things like the uncredited entry in a different locality, and printer control characters in the Overview, people don't always follow the rules! I didn't doubt that the profile existed, I was just curious as to how it came to be, since Profiler will not allow you to enter a name with a blank first name field. I checked, and the only way I found was by pasting the info. But if that's how it was done, I still wonder how that info was created. I don't allow for blank first name in my programs, and I don't know of any other third party program that does. One could edit data manually to enter such a name, but that would mean someone was intentionally trying to corrupt the database. So it's a bit of a mystery, but perhaps not really relevant in this context. | | | My freeware tools for DVD Profiler users. Gunnar |
| Registered: May 19, 2007 | Reputation: | Posts: 5,715 |
| Posted: | | | | The profile was created back then when the DVD Profiler was owned by Intervocative. A long time for bugs to be removed from programs. Data usually has a longer life cycle than programs with their bugs... | | | Complete list of Common Names • A good point for starting with Headshots (and v11.1) |
| Registered: March 18, 2007 | Reputation: | Posts: 6,463 |
| Posted: | | | | Well you know as programmers, we are taught that "bad data" is no excuse for a program malfunction. So, the relevance is that somehow we have to handle it. And if we can still provide some value to the user, all the better. So, we use the credit info, even if it was put in the wrong place. Guess we are lucky that some people still know the history of the data. | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. | | | Last edited: by mediadogg |
| Registered: March 18, 2007 | Reputation: | Posts: 6,463 |
| Posted: | | | | Ok, accuracy improved. Moved progress bar so that it does not intrude on the UI (no longer stops you from working while it is running). Still chasing a couple of elusive glitches. Hopefully a video and release within a couple more days. I've also got to get my taxes done. | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. |
| Registered: March 18, 2007 | Reputation: | Posts: 6,463 |
| Posted: | | | | Today: - Automatic refresh of online profile list to ensure most recent and accurate results - Attempting to match CLT title sort order (ignore "a " and "the " at the beginning) (drat, I was about to make it more general, "der/die/das, los/las, le/le, etc. when I discovered CLT sort breaks its own rules, especially with "die". I found it ignored in some cases, but then "Die Geisha" is inserted between "Da..." and "The E...".) - renumber list after any column sort - another case of formfeed ("\f", hex 0C) in an overview. | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. | | | Last edited: by mediadogg |
| Registered: March 18, 2007 | Reputation: | Posts: 6,463 |
| Posted: | | | | Today: - Hope to report later - Finally bit the bullet and ripped the guts out for a massive rewrite Alright, I've got the options I want (too many) and enough stability that I can reproduce bugs now, so I had the confidence to run Clint Eastwood. I got 6639 individual credits within 3118 profiles, not counting "uncredited", which if counted, yields 5 more. So, I get max 3123 profiles with credits, whereas the CLT claims 3138. Any suggestions for how I can (easily) figure which ones I missed (assuming the CLT is not lying), to see what kind of error could be causing the difference? Not having the original programmers available for the CLT or CLTPlus limits my ability validate results. Is it my bug or theirs? If I didn't get one before, can somebody give me a link to recent CLTPlus XML download for Clint Eastwood? Now that I can load that into CLTBoss, I can easily run my XML scan for comparison. And using CookTop, I can easily get a count of the profiles. (actually after loading the xml, CLTBoss will also have the profile count) Edit: Just ran Will Wallace. Dead on 377 profiles, both in mouse-click manual mode (5 min) and automatic mode (10 min). I might be pretty close to being ready for a video tutorial and release. | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. | | | Last edited: by mediadogg |
| Registered: May 19, 2007 | Reputation: | Posts: 5,715 |
| | Registered: March 18, 2007 | Reputation: | Posts: 6,463 |
| Posted: | | | | Quoting AiAustria: Quote: Clint Eastwood CLTPlus Many thanks for that. Of course I have known about how to use the original CLTPlus. I have explicityly chosen not to, because the author is on record as having withdrawn it from the Community. So, I can in all honesty say that, even though it certainly would have been of benefit, I have not used or reverse-engineered any aspect of CLTPlus. Just for the record. But I think I can legitimately process the resulting XML file provided to me by others, so thanks again for that. | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. |
| Registered: March 18, 2007 | Reputation: | Posts: 6,463 |
| Posted: | | | | Quoting AiAustria: Quote: Clint Eastwood CLTPlus Ok, super mega helpful. I was able to quickly discover and fix a couple of bugs in my CLTPlus Import, and then after running the XML Scan on the 3138 profiles, CLTBoss found credits in all of them. So, the missing profiles was a result of a glitch in the automated scraping, not the credits match. I will spend some more time seeing if I can tighten up the reliability of the automated scraping even more. Manual scraping is pretty solid, but who wants to do 130 mouse clicks? (1 per page plus a couple to get going). The "click-scrape-next page" method I came up with is actually very reliable, and for anything under 20 pages, it is also very fast - 20 mouse clicks (or less) and done. But for 126 pages, I don't think so. | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. | | | Last edited: by mediadogg |
| Registered: March 18, 2007 | Reputation: | Posts: 6,463 |
| Posted: | | | | Man I hope you guys are not totally sick of me, but I am determined to get this damned thing right. If I were smarter, it would be faster. Sorry. Anyways, is there anybody that can confirm how many profiles in this XML file have credits for "ziyi zhang" spelled exactly that way? (I know there are 366 profiles in the file. But do I really have two profiles with no valid credits???? If so, which two?) I appreciate the help in advance. I am so dizzy with code variations and watching progress bars ... | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. | | | Last edited: by mediadogg |
| Registered: March 14, 2007 | Reputation: | Posts: 4,695 |
| Posted: | | | | Just counting them in Editpad Pro I get 330 Ziyi Zhang and 36 Zhang Ziyi. That would seem to account for all 366 profiles. And my CltInfo program (not yet released) gave the same numbers. | | | My freeware tools for DVD Profiler users. Gunnar | | | Last edited: by GSyren |
| Registered: March 18, 2007 | Reputation: | Posts: 6,463 |
| Posted: | | | | Quoting GSyren: Quote: Just counting them in Editpad Pro I get 330 Ziyi Zhang and 36 Zhang Ziyi. That would seem to account for all 366 profiles. And my CltInfo program (not yet released) gave the same numbers. Thank you. What I am trying to determine is under what circumstances I get a different result from the CLT, which people consider to be the gold standard. So, yes, while any profile can contain any variant, I want to understand why when I search for "ziyi zhang" (case independent), I get 366, when the CLT says 364. So in this case, I "don't care" that other variants are in the profiles. I can also search for those, and the value of a program like CLTInfo, is that it takes the raw XML and squeezes out all variants, so great, I am not attempting to duplicate that (but maybe I should?). Most of the time, CLTBoss returns dead on the same number as the CLT for a specific variant ... just when I think I'm ready to release, I hit an exception, and when I do, I try to figure out why. My search attempts to be exactly what AiAustria has suggested: for any given profile, first accept a case-independent match on creditedAs, and failing that accept a case-independent match on the concatenation of first/middle/last, ignoring birthyear (unless optionally chosen by the user). The fact that your counts add up to 366 is interesting, but in fact, my code (and I will double check) was actually ONLY searching for "ziyi zhang" - that specific string. So, just trying to figure that out. And then why would CLTInfo not get 364 for "ziyi zhang" - would't we expect the same number as the CLT (assuming that the set of profiles has been correctly collected). Since we don't actually know what the search parameter for the CLT is, and how it is coded, I am taking the description by AiAustria as "gospel." | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. | | | Last edited: by mediadogg |
| Registered: March 18, 2007 | Reputation: | Posts: 6,463 |
| Posted: | | | | Aha, I just found something. The CLT search seems to only trim blanks from the beginning and end of the strings to be compared.
I think my code does it two ways, inconsistently sometimes compressing mutiple blanks down to 1 blank between tokens. That might cause me to pick up extra matches if the profile has multiple blanks inside a "creditedAs" due to a typing error. I will check for that. | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. | | | Last edited: by mediadogg |
| Registered: March 14, 2007 | Reputation: | Posts: 4,695 |
| Posted: | | | | My program shows how they are actually credited in Profiler.
So the result means that there are 330 profiles where the credit is Ziyi Zhang and 36 profiles with Zhang Ziyi [Ziyi Zhang]. | | | My freeware tools for DVD Profiler users. Gunnar |
| Registered: March 18, 2007 | Reputation: | Posts: 6,463 |
| Posted: | | | | Quoting GSyren: Quote: My program shows how they are actually credited in Profiler.
So the result means that there are 330 profiles where the credit is Ziyi Zhang and 36 profiles with Zhang Ziyi [Ziyi Zhang]. Correct, but what does "credited" mean? The CLT says 364, and I believe that corresponds to is AiAustria's definition, that includes "CreditedAs". And remember, the XML prepared by CLTBoss in this case did not include a search for Zhang Ziyi, but of course you can find those variants in the profile. I actually thing the 330 + is a coincidence in this case. There are multiple cases: (1) - credited only in the creditedAs field (2) - credited in both creditedAs and F/M/L (3) - credited only in F/M/L (4) - neither I am counting a match on either (1), (2) or (3), as a single (need to check for a double count bug here) match for a specific credit entry in the profile. A profile is counted (once) when there are 1 or more credit matches. At least that's what I am attempting to code. | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. | | | Last edited: by mediadogg |
| Registered: March 14, 2007 | Reputation: | Posts: 4,695 |
| Posted: | | | | Quoting mediadogg: Quote: I am counting a match on either (1), (2) or (3) Sounds right to me. And I am presenting tha data based on F/M/L since that is the data that the Common Names are based on. Or at least that's how I have understood it. Personally I have no opinion on this, I am trusting that AiAustria will tell me if I'm doing it wrong. | | | My freeware tools for DVD Profiler users. Gunnar |
|
|
Invelos Forums->General: Website Discussion |
Page:
1... 8 9 10 11 12 ...26 Previous Next
|
|
|
|
|
|
|
|
|