January 14, 2022 at 12:47 am #7669KenNParticipant
Recently I conducted an experiment in which 61 participants participated, and the Results panel shows “Preview of latest submission (out of 61)” indicating there were 61 participants, but when I downloaded the csv file, there were only 60 participants, with 1 missing.
I checked further into the results, and I found that the data of the 2nd participant is missing. The results.csv indicates that the presence of the participant in question because the following rows have been recorded:
# # Results on Thu # USER AGENT: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML # Design number was non-random = 0 # # Columns below this comment are as follows: # 1. Results reception time. # 2. MD5 hash of participant's IP address. # 3. Controller name. # 4. Order number of item. # 5. Inner element number. # 6. Label. # 7. Latin Square Group. #
However, no further rows are recorded with this participant.
Now I remember that I had similar cases in the past. In all these cases, the participants whose data are missing seem to have completed the tasks to the end, because they provided us with the randomly generated codes that are shown at the end of the experiment.
My question is, how and under what conditions could this happen? Are there any precautions I can take to prevent this?
Thanks!January 14, 2022 at 12:43 pm #7670
Unfortunately that submission was sent to the farm’s servers precisely at the beginning of a short downtime episode, at 2:12am GMT on January 13. The submission entry was created, but the service crashed right before the data could be saved to memory
Unfortunately there is not much you can do on your end. I have been monitoring the problem more closely these past couple days, and the crashes appear to happen when there is a lot of incoming data at once, so minimizing the data in the submissions is good practice, but in your case, we’re talking about only 30 lines per submission, which is already very little
My attempts at controlling or working around those crashes have not been the most successful so far. The service is supposed to temporarily save a copy of the incoming data on disk before sending it to the database, but even that copy seems to have failed to be saved in your case
I apologize for the inconvenience. I am still actively working on addressing the issue
JeremyJanuary 15, 2022 at 9:59 pm #7671KenNParticipant
OK, thank you for the detailed explanation regarding the situation. I really appreciate the presence of this platform and also thank you for your efforts to improve the system!
-Ken N.January 18, 2022 at 12:49 pm #7674mereloParticipant
I seem to have the same problem, but out of 10 participants, 6 were not saved. The details are the same; I have a few rows per “missing” participant recorded in my results file, like so:
# Results on Tue
# USER AGENT: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML
# Design number was non-random = 8
# Columns below this comment are as follows:
# 1. Results reception time.
# 2. MD5 hash of participant’s IP address.
# 3. Controller name.
# 4. Order number of item.
# 5. Inner element number.
# 6. Label.
# 7. Latin Square Group.
All 6 participants submitted between 15:57 GMT and 16:17:47 GMT; other participants’ data was successfully stored before and after this point. Can this be attributed to a downtime episode as well?
Would minimising the data in submissions help prevent this, and if so, how would I do that? (I still need to collect 160 participants.)
MerelJanuary 18, 2022 at 7:31 pm #7681
The database shows that your experiment did receive 10 submissions, with between 1918 and 1922 rows for each of them, so you should see all your submissions and the corresponding rows in the results file. Maybe you tried to generate the results file before all the incoming data had finished being processed by the server?
Let me know if the problem persists
JeremyJanuary 19, 2022 at 3:35 am #7682mereloParticipant
You’re right, thank you! I tried downloading it several times yesterday and it kept coming out the same way, but reloading it this morning made more rows appear.May 13, 2022 at 3:54 pm #8179monicadoParticipant
I’m having a similar issue with missing results in my data. It appears I’m missing 4 submissions from the following experiment: https://farm.pcibex.net/r/SpzIlM/. I wonder if there is any chance this data can be recovered from the PCIbex server?
MonicaMay 13, 2022 at 4:27 pm #8180
I see 13 submissions for that experiment, the four most recent ones received on April 27, April 28, May 6 and May 12. Each of your 13 submissions has between 54 and 58 rows
Were you expecting another 4 submissions (for a total of 17 submissions)?
JeremyAugust 9, 2023 at 9:16 am #10806ajessenParticipant
now I have that exact same problem with my experiment: 1 out of 4 participants (who clearly did the experiment (recruited via prolific, and s/he got the completion code right)) shows me only the rows as posted above:
# Results on Wed 09 Aug 2023 11:30:00 GMT # USER AGENT: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML like Gecko) Chrome/126.96.36.199 Safari/537.36 # Design number was non-random = 0 # # Columns below this comment are as follows: # 1. Results reception time. # 2. MD5 hash of participant's IP address. # 3. Controller name. # 4. Order number of item. # 5. Inner element number. # 6. Label. # 7. Latin Square Group.
link to the experiment:
It has been several hours now – I refreshed the page, logged out and in again of PCIbex Farm, but there are never more rows. Normally when I do these little experiments, I can download all of the results when prolific is telling me that the participants completed the experiment.
Is it possible that it sometimes takes longer for the results to show up? Or is it again a problem with the server?
Best, AnnaAugust 15, 2023 at 2:55 am #10815
Sorry for the slow reply. I only have intermittent access to the database, and I haven’t had the chance to look it up yet. I’ll take a look at it, but judging from the comment lines you see in the results file, if the submission is still not present when you download the file, chances are that an entry was added to the database but the content never reached it for some reason, which can happen when the load on the server is high. Apologies for the inconvenience
JeremyAugust 16, 2023 at 4:17 am #10816
I was finally able to check our servers, and unfortunately I can now confirm that the rows for that submission are nowhere to be found. Apologies for the inconvenience
JeremyAugust 16, 2023 at 4:18 am #10817valentina_pParticipant
The results file obtained from an experiment seems to be affected by partial data loss. I ran the survey yesterday, August 15, and advertised it through the Clickworker platform. There it said, around 1 pm (Berlin time), that the survey was completed by 32 people (my desired N). So, I checked my results file on PC Ibex, but there are only 19 people listed and the last update of the file was around 11 am. So, I am missing 13 participants (entirely, no rows are present in the data file after the 19th participant), as the results do not cover the last two hours of the experiment. Is there anything that can be done to retrieve the missing data? Why could that have happened?
We had already used the same script for other experiments, and this problem had never occurred.
Following the suggestions from the support forum, I have already refreshed the PC Ibex page, logged out and then back in, and tried from different browsers and devices, but nothing helps.
This is the link to the experiment if it helps:
ValentinaAugust 16, 2023 at 4:35 am #10818
I replied to an email sent to email@example.com about this. Like I explained in my email, that address does not point to the PCIbex Farm, so we cannot help with data recovery. However, keep in mind that some participants might not have really completed the experiment but instead retrieved the confirmation code by other means to get credits
JeremyAugust 16, 2023 at 7:40 am #10821valentina_pParticipant
Thank you for the quick and helpful reply, Jeremy!
- You must be logged in to reply to this topic.