WEBVTT

00:00.000 --> 00:12.480
Hello everyone, my name is Yelanda Pensa, I'm a senior researcher at SUP, which is a Swiss

00:12.480 --> 00:17.880
University at the Institute of Design, but I'm also an active member of Wikipedia being

00:17.880 --> 00:23.760
contributing since 2006, I've been chair of Wikimedia Italy, and now I'm also a national

00:23.760 --> 00:28.880
coordinator of Glams, which means that I spend a lot of time selling in cyclopideas around

00:28.880 --> 00:35.280
Switzerland as a volunteer. And this is a tool about Wikipedia, I don't think it's necessary

00:35.280 --> 00:43.280
to present Wikipedia. This year is the 25th anniversary, it is an open infrastructure, open

00:43.280 --> 00:50.520
content, it is also a website that serves the open science and the open Glams, and is also

00:50.520 --> 00:55.480
an open education on the resource, so I think it's a good symbol of the open in general.

00:55.480 --> 01:00.800
And the reason why we're developing this tool is because I spend a lot of time working on

01:00.800 --> 01:06.600
content, and I started by contributing to contemporary African art, and one of the problems

01:06.600 --> 01:13.160
with Wikipedia is that it's very difficult to grasp what a topic is about, how is the situation

01:13.160 --> 01:19.360
of a topic? You get an access to articles once at a time, and when you work on project,

01:19.360 --> 01:24.000
you get list of articles, but it's very difficult to get a sense of the landscape. So how

01:24.080 --> 01:30.480
a topic is documented on Wikipedia? What is the quality of that content? Is it reach or not?

01:30.480 --> 01:37.920
So I've been testing some visualization in some research project, I've been promoting.

01:37.920 --> 01:43.280
So one of the first visualization that we've been working on was to transform 100 articles

01:43.280 --> 01:49.520
related to the South African school system, and seeing how we can show how those articles

01:49.600 --> 01:57.680
are on Wikipedia, and how we change them by contributing and improving them.

01:57.680 --> 02:03.800
Another visualization tool that I've been developing with Giovanni Perfeita, was created

02:03.800 --> 02:11.160
in 2020 during the pandemic, in Italy, all schools were closed, so we were thinking, okay,

02:11.160 --> 02:17.360
Wikipedia is an open educational resource tool, but how is actually the quality of that content?

02:17.440 --> 02:25.920
We did research to define list of articles related to the school system. We looked at content related

02:25.920 --> 02:32.240
from primary school to high school on different subject, looking at high school books and different

02:32.960 --> 02:42.160
books. We organized topics related to disciplines, and we looked at how those articles were

02:42.240 --> 02:48.000
in different years. So we started in 2020, we involved the community to improve articles,

02:48.640 --> 02:56.960
and we checked how those articles changed throughout time. So what you see in this visualization

02:56.960 --> 03:01.840
is basically each article is a bubble. The dimension of the bubble tells you the dimension of the

03:01.840 --> 03:09.120
article. The dot inside is the dimension of the in-ship it, so the lead session, the first part

03:09.200 --> 03:14.640
of the article, which other times was of course a lot read, and also we did a focus group in which

03:14.640 --> 03:20.880
also teacher and students identified as very important, and also you have errors got each article in

03:20.880 --> 03:26.480
details and see how it changed through time, and you can organize a content according to

03:28.240 --> 03:35.920
different categories. So the idea of this visualization was really to identify how the landscape

03:36.000 --> 03:41.840
of certain article presented, and also to look at issue that could be fixed quite easily. So for

03:41.840 --> 03:49.200
example, if an article was already mentioning cleanup tags, or he had no images, so identify what

03:49.200 --> 03:56.880
could be the volunteers do quite quickly. So this project was also replicated in Uruguay,

03:56.880 --> 04:03.760
so they used the software to adapt it and to look at the articles related to the Uruguay

04:03.760 --> 04:12.320
and school system. So after this experience, we decided to create a tool based on this experience,

04:12.320 --> 04:20.560
and a tool that can be applied to any set of article, and we decided to focus with the research

04:20.560 --> 04:25.840
on climate change and sustainability, because of course, it's an important topic for everyone,

04:25.840 --> 04:32.240
and improving content on Wikipedia can have obviously a broader effect. So we started working on

04:32.240 --> 04:37.200
the visualization, and at the same time, Wikipedia education, which is a foundation in the United

04:37.200 --> 04:43.040
State that works a lot with classrooms and the school books, was at the same time working on

04:43.040 --> 04:50.320
this visualizing impact tool that was already working on the impact of contribution by students.

04:51.040 --> 04:57.200
So we decided to join the two projects and work together, and then take a little break.

05:03.200 --> 05:09.200
So the idea of the visualization is really to serve volunteer, to facilitate their work,

05:09.200 --> 05:13.280
to allow them to identify what is the priority, which article they should focus on.

05:14.320 --> 05:19.440
But also create a landscape that can be useful for researchers to identify topic,

05:19.440 --> 05:27.760
understanding how a topic is described, and how is it treated on Wikipedia, and also facilitating

05:27.840 --> 05:32.400
contribution by glams, analysis by journalists, and contribution by teachers.

05:33.520 --> 05:43.280
So at the moment, we have a pilot tool that you can find online, so it focuses only on climate change,

05:43.280 --> 05:49.760
but in reality, the tool can extract data from Wikipedia, or from categories on Wikipedia.

05:50.640 --> 05:54.720
And it already shows articles and the situation.

05:55.680 --> 06:02.400
The reason why the first visualization is, you see, all the bubbles squeezed on the top is the

06:02.400 --> 06:12.320
Donald Trump problem. So of course, the tool created a system to erase him, so that is actually

06:12.320 --> 06:23.120
possible to see the properly, the articles. It's obvious the first step, but one of the issues that

06:23.120 --> 06:28.080
are obviously we're facing now is that at the beginning, the idea was to focus on the number of

06:28.080 --> 06:33.920
visualization, but today things are changing, of course, with the machines that are visualizing,

06:33.920 --> 06:40.080
Wikipedia, more than humans. So we are changing some of the feature focusing more on contributors,

06:40.080 --> 06:45.120
and see which are the articles that have more contributors. We've been also

06:45.200 --> 06:51.760
involving, well, of course, you find the new or the data and the software online.

06:52.960 --> 06:59.440
And at the moment, we're still in the co-designing phase. So some of the visual, we are working on the

06:59.440 --> 07:07.440
first visualization, but we will be comparing languages, the project at the moment focuses on four languages.

07:07.440 --> 07:14.800
So Italian Spanish English and French, and we're also looking at how changes that been made

07:14.880 --> 07:21.680
from one through time. So the next visualization, which will be the Wikimedia Communities,

07:21.680 --> 07:28.480
so you can select articles according to their staff with the introducing the fact that at the

07:28.480 --> 07:35.040
moment I'm also working on trying to bring to first them in 2029, the entire open community.

07:35.040 --> 07:42.880
So Wikipedia, science, open educational resources, creative commerce, open street map communities,

07:42.960 --> 07:46.720
so try to have a joint event in 2029. Thanks.

07:47.120 --> 07:56.240
Thank you very much.

07:56.240 --> 08:03.240
Please.

08:03.240 --> 08:06.240
Please.

08:06.240 --> 08:09.240
The presentation on the type of this radiation.

08:09.240 --> 08:10.240
So, it's so hard.

08:10.240 --> 08:12.240
And you know what's going to be?

08:12.240 --> 08:15.240
The type of exposure to the screen of the distribution.

08:15.240 --> 08:17.240
And you describe the powers.

08:17.240 --> 08:22.240
Do you have plans for different laserization, or is this one,

08:22.240 --> 08:25.240
sitting your needs for now?

08:25.240 --> 08:26.240
No.

08:26.240 --> 08:29.240
One of the.

08:29.240 --> 08:30.240
Yes.

08:30.240 --> 08:36.240
So, at the moment, the question is about the different visualization that we're planning.

08:36.240 --> 08:38.240
No, there are different kind of visualization.

08:38.240 --> 08:42.240
This one is the first approach that we're working on.

08:42.240 --> 08:50.240
The emphasis will be also done on comparing how the initial set of article was presented

08:50.240 --> 08:54.240
and how changes produced a different visualization.

08:54.240 --> 08:56.240
This is something that is very interesting.

08:56.240 --> 09:01.240
Well, to satisfy volunteers, also to give feedback to the wiki project.

09:01.240 --> 09:05.240
Because a lot of a wiki project, a managed by one person or two people.

09:05.240 --> 09:10.240
So, the fact of having a feedback from my visualization is also important to encourage them.

09:10.240 --> 09:16.240
But it's also interesting for fundraising for affiliates because they can present a funders,

09:16.240 --> 09:21.240
what actually they did by organizing event by contributing to wiki media.

09:21.240 --> 09:27.240
So, the aspect of the before after is probably what it will transform.

09:27.240 --> 09:33.240
Also, another issue is how to create visualization that can be targeting different people.

09:33.240 --> 09:35.240
So, this is something that we are browsing.

09:35.240 --> 09:42.240
At the moment where we started with this catch, we have some more sketches in particularly focusing on comparing languages,

09:42.240 --> 09:47.240
which is also something we're focusing on for languages, to not make it huge.

09:47.240 --> 09:51.240
But we are designing the tools that you can be broadened.

09:51.240 --> 09:54.240
And also consider that this is a dashboard.

09:54.240 --> 09:58.240
So, wiki education already manages a dashboard,

09:58.240 --> 10:01.240
in particular to manage events in the community.

10:01.240 --> 10:05.240
So, the dashboard will be included within their work.

10:05.240 --> 10:08.240
So, people will be able to apply it to their needs,

10:08.240 --> 10:12.240
and you can serve really individuals and more projects.

10:13.240 --> 10:15.240
Yes, please.

10:15.240 --> 10:19.240
Are there any more visualizations that are more focused on the user,

10:19.240 --> 10:24.240
rather than the editor, such as the visualization,

10:24.240 --> 10:27.240
back to the media's part of everything else.

10:27.240 --> 10:33.240
The visualization focus more on the behind the scene of wiki media.

10:33.240 --> 10:37.240
So, we don't focus much on the users of the readers.

10:38.240 --> 10:43.240
In particular, now, with the change on AI, this is also an issue.

10:43.240 --> 10:47.240
One of the things that we are considering is also how to look at,

10:47.240 --> 10:51.240
how to identify in articles different styles, for example,

10:51.240 --> 10:56.240
that could be adapted and could be improved, also according to readers.

10:56.240 --> 11:01.240
But it's true that, for example, monitoring the impact of the article is something

11:01.240 --> 11:05.240
that a lot of researcher would love to know more.

11:05.240 --> 11:08.240
But there is always a, it's also a tricky issue,

11:08.240 --> 11:11.240
because for example, analyzing a discussion page,

11:11.240 --> 11:14.240
we show the size of the discussion page.

11:14.240 --> 11:17.240
If an article is, for example, a controversial,

11:17.240 --> 11:20.240
but we don't explore the discussion page and the dynamics,

11:20.240 --> 11:22.240
also for privacy issues.

11:22.240 --> 11:25.240
And so, it's something that we are not really putting on,

11:25.240 --> 11:26.240
on front of it.

11:26.240 --> 11:28.240
So, we're more focusing on the work behind the scene,

11:28.240 --> 11:31.240
to contribute and to improve content.

11:31.240 --> 11:34.240
Thank you.

