Skip to main content

Artificial intelligence and gymnastics scoring - your opinions please

The Guardian yesterday has an interesting article about the FIG's plans to introduce computer judging to gymnastics.  I suggest you give it a read.

It won't surprise you to know that I have lots of things to say about this; I think that AI can contribute something to scoring, but not to judgement.  If the implementation of this initiative is not managed with common sense and imagination, we could find ourselves with a sport that is even more devoid of artistry and the aesthetic.  

In response to a comment asking for evidence of biased judging, especially in favour of the USA, I commented as follows.  Would be interested in hearing your opinions.  

'There is plenty [evidence of bias]. But the judging can be unreliable in all sorts of different directions, not just for the USA.

The problem arises at Code level, when the grading of moves and the bonus points are determined. Every country has a say in this, but naturally those countries with the strongest political representation in the sport will have the strongest influence. It is something that builds up over the years. So for example in WAG we currently have a Code that on floor and vault emphasises powerful acrobatics to the extent that the aesthetic has gone AWOL on floor in all but a few exceptional cases.

Vaulting requirements have changed and are so physically demanding that very few gymnasts in the world can prepare two competitive vaults at international level - e.g. in Europe in 2013 only 13 gymnasts attempted to qualify for the vault final of eight gymnasts out of a field of several hundred gymnasts.

At world level the USA leads in vaulting and acrobatics and there is a significant gap between the leading Americans and the rest of the world. When combined with what could be considered biased judging - a natural human tendency to overlook errors in those considered to have an almost mystical command of the sport - this adds up to an advantage [which might be considered a bias]. For example, Simone Biles' highly acrobatic work is astonishing in its accuracy and has extremely high difficulty scores, yet the judges seem to ignore failures in the aesthetic quality of her work. Gradually her scores on bars and beam have crept up as belief in her strengths elsewhere make it uncomfortable for the judges to deduct. The 'wow' factor blinds judges and fans to the less than perfect state of artistic presentation in some of Biles' work while others with more grace and less athleticism struggle to find the same certainty and confidence in the judges' evaluations of their work. These are minor, often and usually tiny granules of distinction that build up in one gymnast's and one style's favour over another. They in turn affect the shape of the sport as it progresses and the Code develops, and in performance affect the psychology of the gymnast and the reliability of competition.

A computer system that relies on measurement and quantification of movements will only emphasise these distinctions and detract from the aesthetic side, unless its implementation is carefully managed to allow for the judges' panel to pay more attention to the impression of the whole routine. There would have to be a splitting of the scores to introduce a technical mark (computer) and an artistic score (judges). As far as I can read, the FIG hasn't yet considered this, so unless the target of 2020 is purely a pilot run, they are getting ahead of themselves on every apparatus except vault. If the pilot is on vault only, as this article seems to suggest, then that could be a good thing as vaulting is a single skill and the measurement and judging process already seems to be highly technical and well elaborated.

The role of President of the FIG is at face value a mouthpiece job, yet for decades this mouthpiece has influenced the direction of the sport disproportionately and favour has been cast on his (no female President ever!) national programme. Titov during the Soviet dominated era saw the language of gymnastics favouring aesthetic, innovative gymnastics, Grandi presided over a period of growth for Italian WAG gymnastics [with the introduction of the additive score leading to the only Italian AA World Champion, with a fall], and now Watanabe introduces a technological step forward that is of potential benefit to the Japanese economy, while the JPN gymnastics programme continues to lead MAG and to grow WAG.

Introducing computer judging will only emphasise the growing tendency in both MAG and WAG to favour content over quality unless the FIG considers the whole picture and implements gradually with review of the gymnastics routines favoured [in addition to the calculation of D scores and deductions for faulty execution] and their likely influence on the direction of the sport.'

Comments

  1. We have been using AI in training for a number of years now (wrist/ankle/toe/hip sensors) and similar technology is used in other sports (Hawkeye in tennis is an obvious example but fencing, boxing and other sports use their own forms of this technology). I have no problem with AI judging provided the technology used is made available at all levels so all gymnasts and judges can learn with it and it promotes a level playing field and balanced judging. In fact, compulsories were largely designed to provide just that. They were just a small sample of all possible routines which showed how well a gymnast had learned those particular technical requirements.

    To me, gymnastics has far bigger problems than judging. WADA (and thus, political influence), 40-60% injury lists and nation-swapping are far more damaging to the identity of gymnastics than judging (biased or not). We would do well to remember the discussions of 90's when dropping gymnastics from the Olympics was seriously being considered and why that was

    ReplyDelete
    Replies
    1. That's interesting Dave. Do you know if the new system will be compatible with your metrics?

      Delete
    2. We built the system at our local gym ourselves from existing software used in cricket to help with bowling actions. It's used by the gymnasts themselves to judge their body-forms during skills, splits, leaps, twists etc and we can go through the video with them if needed. This frees up our time as trainers. It could feasibly be used for judging but we do not run competitions there

      FIG and the IOC are not exactly keen to have judging transparency and I doubt they will make this system open for criticism or cross-compatible with any existing system. I believe it will be completely hidden with only the scores announced.

      One thing which bothers me. It will be a marketing goldmine if gyms around the world are forced to buy into this system and Fuji will be keen to keep the technology and software patents

      Delete

Post a Comment

Popular posts from this blog

Remembering last summer - Nelli Kim, her judges and Viktoria Komova

In view of Nelli Kim's recent interview , Lupita and I thought it timely to revisit the performance of some of the WTC President's judges over past competitions ... this article from 27th August 2012 is reposted here, as a reminder. You will find a link to the FIG's newly published book of results at the Olympic Games here .  This year, they have broken down the judge's execution scores so you can see exactly how each judge evaluated the gymnasts' performances.  It makes for interesting reading - if only I had more time to analyse each judge's marking.  A skim reading already highlights multiple inconsistencies in individual judges' marks and makes you wonder why they bother with the jury at all. I have taken the time to look at the reference judges' scores for the top four in the women's all around.  The FIG explains here what their role is, and how they are selected.  I even used my calculator, which is a risky thing in my hands.  M

UPDATE 23/9 - Russian WAG team for Nanning confirmed

Daria Spiridonova will compete at her first World Championships this autumn.  Picture : RGF Natalia Kalugina has confirmed the Russian team for Nanning : Aliya Mustafina, Maria Kharenkova, Tatiana Nabieva,Ekaterina Kramarenko, Alla Sosnitskaya, Daria Spiridonova.  Reserve : Polina Fyodorova Here is a paraphrased translation of a comment by Natalia Kalugina on her Facebook page : 'Aliya has confidence in competition and she is, kind of, a coach to this team.  In Europe she succeeded in this role and she has told the coaches that she even liked it. The main fighting force will be Kharenkova, Sosnitskaya and Spiridonova.  Accordingly, the strongest apparatus will be beam (Marina Bulashenko With God!).  The Chinese women, of course, have been known to win that apparatus, but if one falls, they all fall.   Alla Sosnitskaya could compete in the vault final, and - in theory - on the floor. On bars, of course, Russia will probably lose to the Chinese women, but they should be able to hold

Andrei Rodionenko explains Russia's performance at Worlds - Lupitatranslates

Rodionenko with European Champion David Belyavski  Courtesy RGF/Elena Mikhailova This is the interview that many people on the internet have already commented on, regarding Andrei Rodionenko's alleged racism.  The original, Russian language version, appears on VTB Bank's website (VTB are sponsors of Russian gymnastics).  It takes cleverer people than me to decide what is racism, what is deliberately perjorative, and what is inferred in an interviewer's question.  For now, I will not comment on this, therefore, but I would ask you to read Lupita's translation carefully before you form your own opinion.   I am providing some links below which might help you to decide where you stand. Definition of racism Definition of sexism BBC Sport article by Matthew Syed : Is it wrong to note that 100m winners are always black?            Updated 24/10 CSKA Moscow: UEFA opens racist chants case             http://www.bbc.co.uk/sport/0/football/24654499 Andrei