Skip to content

Conversation

@TheoDlmz
Copy link
Contributor

No description provided.

@TheoDlmz
Copy link
Contributor Author

PS: for some reason the Pol.is dataset has a publication date in the future and appears as the most recent one on Preflib.

@Simon-Rey
Copy link
Member

Would it not be better to merge these with the previous datasets? At least 73 and 71 seems to me merge-able. We did it like this for AAMAS bidding for instance: different files per year (see https://preflib.github.io/PrefLib-Jekyll/dataset/00037). What do you think?

@TheoDlmz
Copy link
Contributor Author

Yes that could make sense, I made separate files because the set of authors on each one is not exactly the same, and they each require their citation... but I could put all citations in "required citations"? Or try to find a paper on which every one is? But true, it might be better to find a way to merge them, because more are coming (see https://theo.delemazure.fr/datasets/).

Also I had a question: is it possible to have fractionnal "counts" in the file like "2.31: 1, 2, {3,4}, 5" ? (Asking because in many of these datasets we have weights for votes,to reduce biases)

@Simon-Rey
Copy link
Member

I think you can put all the papers in the required citations (that no one really use unfortunately...).

About weights, it's not yet part of the format specification but strictly speaking why not. I'm just not sure that they should be included in the data for this specific case: the weight are values that you chose yourselves to balance different factors that you found important right? If so, then it's a bit of an arbitrary choice what weight goes where so I would not put them in. Does that make sense?

@TheoDlmz
Copy link
Contributor Author

Ok I'll do this then! Yes these are values chosen to match the distribution of votes in the actual election (otherwise in some election Mélenchon wins with 60% of the votes). For some use case (like running time) it is not very important but if one want to interpret the result it might be important. I can also just mention that the file with weight is somewhere else for people interested.

@TheoDlmz
Copy link
Contributor Author

I did not do the merging of French datasets yet but added another dataset (scotus data). Will take care of the french datasets next.

@Simon-Rey
Copy link
Member

I wonder if the sanity check failing is because of the line breaks in the Selected Studies. Try to put they all on a single line maybe?

@Simon-Rey
Copy link
Member

It should be the missing spaces after the comma in the header of the file description section... (yeah sorry, weird choice I've made there)

@Simon-Rey
Copy link
Member

Should I then wait for you to merge the French datasets before merging?

@TheoDlmz
Copy link
Contributor Author

TheoDlmz commented Feb 4, 2026

Yes, I will (hopefully) do that before end of week.

@TheoDlmz
Copy link
Contributor Author

TheoDlmz commented Feb 4, 2026

Thanks for finding the bug!

@TheoDlmz
Copy link
Contributor Author

TheoDlmz commented Feb 4, 2026

I think I will split the datasets like this, if that is okay with you :

  • "Voter Autrement" In Situ experiments in French Presidential elections (merge of 2007,2012,2017 data) => Approval and Evaluations
  • Ranking-based experiments in French presdiential elections (merge of two experiments: 2007 and 2017) => Rankings
  • "Voter Autrement" Online experimens in French Presidential elections (merge of 2017 and 2022) => All kinds
  • "Vote Pluriel" Online experiment in 2012 French Presidential election (approval/IRV)

@Simon-Rey
Copy link
Member

That looks good to me. It's a good idea to organise per experiment. It's probably what makes the most sense 😊

@nmattei
Copy link
Member

nmattei commented Feb 4, 2026 via email

@TheoDlmz
Copy link
Contributor Author

TheoDlmz commented Feb 6, 2026

Alleluia :D

@Simon-Rey
Copy link
Member

Nice, so you're all done? :)

@TheoDlmz
Copy link
Contributor Author

TheoDlmz commented Feb 8, 2026

Yep !

@Simon-Rey Simon-Rey merged commit 18d4b75 into PrefLib:main Feb 9, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants