• @[email protected]
    link
    fedilink
    English
    311 days ago

    Sounds like those statistics output would the heavily biased by whatever process you were using to turn names into genders. In short, a bad idea.

    • @[email protected]
      link
      fedilink
      410 days ago

      “Since the dataset isn’t 100% perfectly annotated for analysis, we should give up the whole project entirely.”

      • @[email protected]
        link
        fedilink
        2
        edit-2
        10 days ago

        No, since the dataset is bound to give nonsensical results, we search for sources that are more precise. Hint: “Andrea” already mentioned and Japanese names