Welcome to the Invelos forums. Please read the forum rules before posting.

Read access to our public forums is open to everyone. To post messages, a free registration is required.

If you have an Invelos account, sign in to post.

    Invelos Forums->DVD Profiler: Contribution Discussion Page: 1... 3 4 5 6 7 ...18  Previous   Next
David Ogden Stiers
Author Message
DVD Profiler Unlimited Registrantantolod
Since Dec 02, 2003
Registered: March 13, 2007
Reputation: High Rating
United States Posts: 940
Posted:
PM this userView this user's DVD collectionDirect link to this postReply with quote
Quoting Forget_the_Rest: (bold added)
Quote:
Quoting antolod:
Quote:

I fully support a rule that 3 part names should be parsed 1/2/3 unless documented to be different, but so far Ken has not made that part of the rules.


I'm against that kind of rule & think that each case should be considered separately. Although it may be a common occurrence in the US, it's less common in others. ***


That is the reason why the name can be parsed correctly with documentation. Absent any "starting" point, using the shotgun method of parsing names will only continue the mess we have now. If you are adding a new actor to the db, and you know their name is 1//23, then by all means, add them that way, but include the information in the contribution notes so that 2 years later someone doesn't undo the correct parsing.

What this really boils down to is the amount of work necessary to determine which field each part of a 3-part name goes in. If there is a rule to "word count" unless you can document otherwise, Tim, you, me, or any user in the world should enter a 3-part name exactly the same way and if we bother to research and find it needs "special" handling, we would, each of us, be able to enter it correctly and document why it is entered in that way. Will there be disagreements? Probably. Will this completely eliminate current incorrectly parsed names? Probably not. Will it help? I think it would. Your mileage may vary.
Kevin
DVD Profiler Unlimited RegistrantStar ContributorT!M
Profiling since Dec. 2000
Registered: March 13, 2007
Reputation: Highest Rating
Netherlands Posts: 8,722
Posted:
PM this userDirect link to this postReply with quote
I agree. Anything to improve the mess we're now stuck with - and have been for years.

Oh - did we ever rule out the single name field? What a blessing that would be for all this...  And now I'm out.
DVD Profiler Unlimited RegistrantStar ContributorArdos
Registered: July 31, 2008
Reputation: High Rating
United Kingdom Posts: 2,506
Posted:
PM this userDirect link to this postReply with quote
I can see what you're saying but you're picking 1/2/3 due to culture. Although we may have plenty of people here who would be 1/2/3, we'll also have plenty who are 1//2 3. I don't see that providing info to "prove" either way shouldn't be an option. Yes it may involve some more work, but it's fairer overall. But by "forcing" people to use 1/2/3 if they can't find documentation to prove it should be 1/2 3 (or to support 1/2/3 for that matter) it'll be like the US naming standard is being imposed on all.
DVD Profiler Unlimited Registrantantolod
Since Dec 02, 2003
Registered: March 13, 2007
Reputation: High Rating
United States Posts: 940
Posted:
PM this userView this user's DVD collectionDirect link to this postReply with quote
Quoting Forget_the_Rest:
Quote:
I can see what you're saying but you're picking 1/2/3 due to culture. Although we may have plenty of people here who would be 1/2/3, we'll also have plenty who are 1//2 3. I don't see that providing info to "prove" either way shouldn't be an option. Yes it may involve some more work, but it's fairer overall. But by "forcing" people to use 1/2/3 if they can't find documentation to prove it should be 1/2 3 (or to support 1/2/3 for that matter) it'll be like the US naming standard is being imposed on all.


While it might be culture for me to pick 1/2/3 as a standard, you surely don't want each and every 3-part name to have to be documented in the contribution notes do you? I'd be willing to bet that the entire database has near 80% or more of the 3-part names parsed 1/2/3 correctly. Why on earth would we want to have to document those when the minority of 1 2//3 or 1//2 3 names can still be entered correctly with some documentation?

No rules are about fairness. The rules of baseball (or any other sport of your choice) are only to try to insure fairness as to the officiating of the game, and don't prevent one team from beating the other 100 to 0. What's fair about a team of 7 foot tall basket ball players against a team of 5 footers? The only fair thing to do would be a single name field, with as credited data only. Then no arguments about correct parsing, common names, breaking linking or anything else, it either matches the film credits or it doesn't. That would be fair, but severely cripple the functionality of the program as it stands today.

A "1/2/3" parsing rule would be an attempt to keep everyone on the same page and establish a standard, so we don't have 2 or 3 variants of a 3-part name. The biggest problem is when 1 user correctly documents and enters a 1//2 3 name but other users profiling the same film but different UPC/EAN don't. Still 2 forms for the same person. I'm not sure that is something that will ever be completely eliminated, no matter what any rule might say, since each profile stands on its own. The only way to eliminate multi-variants for any name is for there to be a central db of names and I'm not sure how that could be implemented.

The current system, that allows basically any parsing a person wants to contribute works how ever well or bad we believe, and we probably don't need a parsing rule even though I would support one. Most of the time, most names are probably entered the correct way, but there are a lot of profiles that haven't been looked at for a very long time. My limited experience in looking at name variants has shown me that there are a huge number of profiles with IMDB scraped data that aren't getting many updates. Those "inactive" profiles are the ones that keep the CLT from giving good results, and though parsing is not something the CLT cares about, if a user starts to enter a person's name and their local has the incorrect version, that's the one that will be used most of the time, whether it's the parsing or the IMDB name.
Kevin
Invelos Software, Inc. RepresentativeKen Cole
Invelos Software
Registered: March 10, 2007
United States Posts: 4,282
Posted:
PM this userEmail this userVisit this user's homepageView this user's DVD collectionDirect link to this postReply with quote
We don't intend to move to a single name field since it would reduce functionality as others have mentioned.  However, I don't see a problem with moving to a two-field name, which would reduce (although not eliminate) parsing disputes.  Thoughts?
Invelos Software, Inc. Representative
DVD Profiler Unlimited RegistrantStar Contributortweeter
I aim to misbehave
Registered: June 12, 2007
Reputation: High Rating
United States Posts: 2,665
Posted:
PM this userView this user's DVD collectionDirect link to this postReply with quote
I'm trying to figure out if two fields would make things better or worse for those with three words in their names.

People have suggested another solution is to rename the three fields from:

First Name
Middle Name
Last Name

to something like:

Name Field #1
Name Field #2
Name Field #3

to hopefully remove some cultural bias.  Maybe that's a better direction to go.

P.S. A small program improvement that may or may not help but is related would be for the program to parse a little smarter when new names are entered.  Example: I enter John C. McGinley for the first time.  When i click Add Cast Member it currently parses it:

First Name: John
Last Name: C. McGinley

instead of looking at the spaces and splitting it among the three fields.

Whatever is decided the programs default parsing should reflect the decision.
Bad movie?  You're soaking in it!
 Last edited: by tweeter
DVD Profiler Unlimited RegistrantStar ContributorWinston Smith
Don't be discommodious
Registered: March 13, 2007
United States Posts: 21,610
Posted:
PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
To be perfectly honest, Ken, I have never understood the real issue on this. the attitude of some users would strike me better were they to say I am Helena Bonham Carter and this is how it should be, they come across to me like they have some sort of strange vested interest or are personally insulted at the thought of Helena/Bonham/Carter. I see it as data On Screen and we are not dealing with family histories, as the long as the data appears in the correct order, I frankly don't see the parsing argument and never have. There is a way that most Americans handle names and that is the way each individual chooses to handle his name, not based on some imposed cultural standards, and who is it exactly that is going to tell me how to parse my name or that of helena Bonham Carter or anyone else.

I fully support
Name Field #1
Name Field #2
Name Field #3

and if you want to add a field for Prefix and Suffix to handle those more appropriately that would be fine as well, though Reverend John Smith, Jr. causes absolutely no grief for me, there seem to be some for whom it does.<shrugs>

Skip
ASSUME NOTHING!!!!!!
CBE, MBE, MoA and proud of it.
Outta here

Billy Video
DVD Profiler Unlimited Registrantantolod
Since Dec 02, 2003
Registered: March 13, 2007
Reputation: High Rating
United States Posts: 940
Posted:
PM this userView this user's DVD collectionDirect link to this postReply with quote
Ken, I'd be more inclined to renaming the existing 3 fields and go to simple word counting, if the goal is to eliminate multiple entries that are based on parsing. Having only 2 name fields would result in the same situation we have now, where user A would enter John/Q. Public and user B would enter John Q./Pubic. All of the arguments about double-barrel last names would still exist.

In fact, even with Field 1, Field 2, Field 3, and simple counting, we will have the potential for problems, because of someone like HBC sometimes being credited with a hyphenated last name HB-C. The linking for her would not work unless a rule was set for hyphenated names to use a non-hyphenated word count common name and Credited as for the hyphenated version.

IMO, what we have now seems to work MOST of the time. As has been pointed out, there still exist today, parsing variations for even some of the double-barrel last named people who are more famous like HBC. I think a standard method spelled out in the rules, with variation allowed with some form of proof that is accumulated in a dedicated thread much like the BY thread, would go a long way to solving this issue.

I also agree it would be nice for prefix and suffix fields if you were so inclined to add them.
Kevin
DVD Profiler Desktop and Mobile RegistrantStar ContributorTheMadMartian
Alien with an attitude
Registered: March 13, 2007
Reputation: Highest Rating
United States Posts: 13,201
Posted:
PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
Quoting Ken Cole:
Quote:
We don't intend to move to a single name field since it would reduce functionality as others have mentioned.  However, I don't see a problem with moving to a two-field name, which would reduce (although not eliminate) parsing disputes.  Thoughts?

A two name field wouldn't solve any of these issues as the problem of which name(s) belong in the last name field will still be there.

Using this name as an example, would it be 'David Ogden/Stiers' or 'David/Ogden Stiers'?

What would solve this issue is to create a base standard, in the rules, with deviation allowed with documentation.  JMHO
No dictator, no invader can hold an imprisoned population by force of arms forever.
There is no greater power in the universe than the need for freedom.
Against this power, governments and tyrants and armies cannot stand.
The Centauri learned this lesson once.
We will teach it to them again.
Though it take a thousand years, we will be free.
- Citizen G'Kar
DVD Profiler Unlimited RegistrantStar ContributorWinston Smith
Don't be discommodious
Registered: March 13, 2007
United States Posts: 21,610
Posted:
PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
Quoting Unicus69:
Quote:
Quoting Ken Cole:
Quote:
We don't intend to move to a single name field since it would reduce functionality as others have mentioned.  However, I don't see a problem with moving to a two-field name, which would reduce (although not eliminate) parsing disputes.  Thoughts?

A two name field wouldn't solve any of these issues as the problem of which name(s) belong in the last name field will still be there.

Using this name as an example, would it be 'David Ogden/Stiers' or 'David/Ogden Stiers'?

What would solve this issue is to create a base standard, in the rules, with deviation allowed with documentation.  JMHO

Agreed, but by also using the Name Field Concept. I have no problem with deviation as long as there is documentation to support it, ANY kind of deviation, not just someone's assumption or guess. Assumptions and guesses will one day cause big trouble and then we will have to unwind and disentangle them, which will prove very problematic.

Skip

Skip
ASSUME NOTHING!!!!!!
CBE, MBE, MoA and proud of it.
Outta here

Billy Video
DVD Profiler Unlimited RegistrantStar Contributorsurfeur51
Since July 3, 2003
Registered: March 29, 2007
Reputation: Great Rating
France Posts: 4,479
Posted:
PM this userView this user's DVD collectionDirect link to this postReply with quote
Quoting Ken Cole:
Quote:
We don't intend to move to a single name field since it would reduce functionality as others have mentioned.  However, I don't see a problem with moving to a two-field name, which would reduce (although not eliminate) parsing disputes.  Thoughts?


Though I'm at this moment a little off those forums, I think this is too important a problem not to give my opinion.
Two-fields names have no more and no less advantages than three fields : you always has to choose where you put Scott for Kristin Scott Thomas, and no rule, no automatic system will ever solve that problem for all type of names (american, european, asian cultures). What is important is to allow to decide which field is the sort field (with a check box, for instance), to resolve stage names and asian names problems.

I think that a solution as
Name Field #1
Name Field #2
Name Field #3
would be the worse. We would get things like Jean/Paul/Belmondo or Cedric/The/Entertainer or The /Dandy/Warhols, which would be even worse than present situation.
Images from movies
 Last edited: by surfeur51
DVD Profiler Unlimited RegistrantStar ContributorAddicted2DVD
Registered: March 13, 2007
Reputation: Highest Rating
United States Posts: 17,330
Posted:
PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
Quoting Unicus69:
Quote:
Quoting Ken Cole:
Quote:
We don't intend to move to a single name field since it would reduce functionality as others have mentioned.  However, I don't see a problem with moving to a two-field name, which would reduce (although not eliminate) parsing disputes.  Thoughts?

A two name field wouldn't solve any of these issues as the problem of which name(s) belong in the last name field will still be there.

Using this name as an example, would it be 'David Ogden/Stiers' or 'David/Ogden Stiers'?

What would solve this issue is to create a base standard, in the rules, with deviation allowed with documentation.  JMHO


I agree... I don't see how going to a 2 name field would help anything at all.  You still have to know where to put the middle word(s) of a 3 (or more) part name. I still think the best bet would be a single name field... that would take care of the parsing problems. But since that is out there needs to be some sort of standard on how to parse the names.
Pete
DVD Profiler Desktop and Mobile RegistrantStar Contributorm.cellophane
tonight's the night...
Registered: March 13, 2007
Reputation: High Rating
United States Posts: 3,480
Posted:
PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
Quoting Unicus69:
Quote:
Quoting Ken Cole:
Quote:
We don't intend to move to a single name field since it would reduce functionality as others have mentioned.  However, I don't see a problem with moving to a two-field name, which would reduce (although not eliminate) parsing disputes.  Thoughts?

A two name field wouldn't solve any of these issues as the problem of which name(s) belong in the last name field will still be there.

Using this name as an example, would it be 'David Ogden/Stiers' or 'David/Ogden Stiers'?

What would solve this issue is to create a base standard, in the rules, with deviation allowed with documentation.  JMHO

I agree with Unicus.

I don't like the suggestion of renaming the fields as Name 1, Name 2, Name 3. In that example, a two word name would occupy Names 1 and 2. Basically we want all of our surnames in the same field so that some people can sort on them.

Stating in the rules, for example, that "Last Name" = "Surname and Suffixes" would be helpful. But basically Unicus' last sentence is what we need, and it wouldn't require messing around with the program interface.
...James

"People fake a lot of human interactions, but I feel like I fake them all, and I fake them very well. That’s my burden, I guess." ~ Dexter Morgan
DVD Profiler Unlimited RegistrantStar ContributorT!M
Profiling since Dec. 2000
Registered: March 13, 2007
Reputation: Highest Rating
Netherlands Posts: 8,722
Posted:
PM this userDirect link to this postReply with quote
Quoting m.cellophane:
Quote:
Stating in the rules, for example, that "Last Name" = "Surname and Suffixes" would be helpful.

The problem is that such words cause problems. People in different places in the world might have completely different understandings of what something like "surname" means. It doesn't necessarily mean the same thing to you as it does to me, and so we'll be stuck with the exact same problems. Those that automatically, instinctively recognize, for instance, "Ogden Stiers" as the guy's surname, will still be tempted to put it into the appropriate field, if that's what you tell them to do.

I'm also not a big fan of "documentation". Sure, it all sounds very reasonable, until you realise it hardly ever exists. The fact that we haven't managed to come up with so much as five examples where a double last name should be listed in the "last name" field, doesn't exactly make me think this is something that we desperately need to keep at all costs (all costs being: perpetual parsing disputes for all eternity). If the number of examples we might actually reach an agreement on is really that low, I'd rather give it up altogether and just have a no-exceptions-at-all standard that gets everyone on the same page immediately.

The key here is not whose preference to choose, but to get everyone on the same page. Using field names that will be interpreted in different ways throughout the world means the problem will remain. As with common names, it's important to realise that it really doesn't matter whether we use variant X or Y - the important thing is that we all use the same one. That's why Ken moved away from using documentable "real" or "correct" names: documenting it just doesn't work, which is why he chose for a starting point that can (relatively) easily be established right here at Invelos.com by everyone. Given that there's even less available documentation on "correct" parsing then there is on "correct" names, it's decidedly not a good idea to re-introduce external documentation here. Sure it sounds nice, but it just doesn't work. It really doesn't matter so much how we do this, as long as we can be sure that EVERYONE, throughout the various regions and localities, automatically does it the same way.
 Last edited: by T!M
DVD Profiler Unlimited RegistrantStar ContributorDaddy DVD
Lost in Translation
Registered: March 14, 2007
Netherlands Posts: 2,366
Posted:
PM this userDirect link to this postReply with quote
I don't think it would be necessary to rename the current first and third field, but I would very happy if the second one would be eliminated because it's often a real pain for us foreigners to find out what should belong there. To find out if a name should be part of the given names or surnames is a lot easier.
Martin Zuidervliet

DVD Profiler Nederlands
 Last edited: by Daddy DVD
DVD Profiler Unlimited RegistrantStar ContributorT!M
Profiling since Dec. 2000
Registered: March 13, 2007
Reputation: Highest Rating
Netherlands Posts: 8,722
Posted:
PM this userDirect link to this postReply with quote
Quoting Ken Cole:
Quote:
I don't see a problem with moving to a two-field name, which would reduce (although not eliminate) parsing disputes.  Thoughts?

I'd certainly like it better over what we have now. At first, I agreed that it would't eliminate the problems entirely, but like Martin, I felt that that most users - including the vast majority that hardly ever comes the forums and doesn't provide input here - would be able to work with that a lot better.

After thinking about it some more, though, I realised you could quite easily accompany this with a line in the rules that clarifies that the first word of any name goes into the "first name" field, and all the rest goes into the second field. And hey presto: then we're done. Everyone on the same page immediately, and no more questions about this ever
 Last edited: by T!M
    Invelos Forums->DVD Profiler: Contribution Discussion Page: 1... 3 4 5 6 7 ...18  Previous   Next