The Superdiversity Index

Name: The Superdiversity Index
Published: 2024-04-30

Pollacci, Laura; Sirbu, Alina, 2024, "The Superdiversity Index", https://doi.org/10.17903/FK2/AVI6AH, Κατάλογος Δεδομένων SoDaNet, version 1

Learn about Data Citation Standards.

Share Data Project

Share this data project on your favorite social media networks.

Data Project Metrics

0 Downloads

Abstract

The Superdiversity dataset includes the Superdiversity Index (SI) calculated on the diversity of the emotional content expressed in texts of different communities. The emotional valences of words used by a community are extracted from Twitter data produced by that specific community. The Superdiversity dataset includes the SI built on Twitter data and lexicon-based Sentiment Analysis. In addition, the dataset comprises other possible diversity measures calculated from the same data from which the SI is calculated, such as the number of tweets in the community language and the Type-Token Ratio, the number of languages in a community. The SI ranges in [0, 1]:

a value of 0 means an emotional content very close between the computed valences and a standard emotional lexicon.
a value of 0.5 indicates no correlation between the emotional content of words used by the community on Twitter and the standard emotional content.
a value of 1 would correspond to the use of terms with the opposite emotional content compared to the standard.

Data is computed at three different geographical scales based on the Classification of Territorial Units for Statistics (NUTS), i.e., NUTS1, NUTS2, and NUTS3, for two different nations Italy and the United Kingdom. The untagged Twitter dataset is composed of just under 73,175,500 geolocalised tweets gathered for 3 months, from the 1st August to the 31st October of 2015.

Subject

Social Sciences

Keyword

MIGRANTS, CULTURAL INDICATORS, SUPERDIVERSITY

Related Publication

Pollacci, Laura, Alina Sirbu, Fosca Giannotti, and Dino Pedreschi. "Measuring the salad bowl: Superdiversity on twitter." arXiv preprint arXiv:2204.10646 (2022). null: 2204.10646

Citation Metadata

Data Project Persistent ID

doi:10.17903/FK2/AVI6AH

Publication Date

2024-04-30

Data Project Category

Indices & Classifications

Title

The Superdiversity Index

Principal Investigator

Pollacci, Laura
(University of Pisa)
- ORCID:
https://orcid.org/0000-0001-9914-1943

Sirbu, Alina
(University of Pisa)
- ORCID:
https://orcid.org/0000-0002-3947-7143

Publisher

SoDaNet - EKKE

Contact

Use email button above to contact.

Pollacci, Laura
(University of Pisa)

Kondyli, Dimitra
(National Centre for Social Research)

Klironomos, Nicolas
(National Centre for Social Research)

Abstract

a value of 0 means an emotional content very close between the computed valences and a standard emotional lexicon.
a value of 0.5 indicates no correlation between the emotional content of words used by the community on Twitter and the standard emotional content.
a value of 1 would correspond to the use of terms with the opposite emotional content compared to the standard.

Subject

Social Sciences

Keyword

MIGRANTS
(ELSST)
https://elsst.cessda.eu

CULTURAL INDICATORS

SUPERDIVERSITY

Topic Classification

Language and linguistics; Cultural and national identity

Related Publication

Pollacci, Laura, Alina Sirbu, Fosca Giannotti, and Dino Pedreschi. "Measuring the salad bowl: Superdiversity on twitter." arXiv preprint arXiv:2204.10646 (2022).
2204.10646
https://arxiv.org/pdf/2204.10646.pdf

Language

English

Contributor

Data processing
Pollacci, Laura
(University of Pisa)
https://pages.di.unipi.it/pollacci/index.html

Project/Study design
Sirbu, Alina
(University of Pisa)
https://kdd.isti.cnr.it/people/sîrbu-alina

Project/Study design
Giannotti, Fosca
(Scuola Normale Superiore, Pisa)
https://kdd.isti.cnr.it/people/giannotti-fosca

Project/Study design
Pedreschi, Dino
(University of Pisa)
https://kdd.isti.cnr.it/people/pedreschi-dino

Grant Information

Horizon 2020
GA 870661

Horizon 2020
GA 654024

Horizon 2020
GA 871042

Distributor

Social Data Network
(SoDaNet)
https://sodanet.gr

Distribution Date

2022-03-30

Depositor

Klironomos, Nicolas

Deposit Date

2024-03-21

Time Period Covered

Start: 2015-08-01
End: 2015-10-31

Kind of Data

Textual data

Related Material

The dataset is a subset of the dataset in Coletto, M., Esuli, A., Lucchese, C., Muntean, C.I., Nardini, F.M., Perego, R., Renso, C.: Perception of social phenomena through the multidimensional analysis of online social networks. Online Social Networks and Media 1, 14–32 (2017) doi: 10.1016/j.osnem.2017.03.001.

Related Data Projects

Data Sources

Research data; Other

Dataset Version

version 1

Online Statistics

Online Thematic Maps

Geospatial Metadata

Geographic Coverage

Italy

United Kingdom

Social Science and Humanities Metadata

Time Method

Cross-section

Collection Mode

Content coding

Type of Research Instrument

Programming script

Unit of Analysis

Media unit: Text


There are no files in this data project.

Superdiversity dataset

Description : The Superdiversity dataset includes the Superdiversity Index (SI) calculated on the diversity of the emotional content expressed in texts of different communities. The emotional valences of words used by a community are extracted from Twitter data produced by that specific community. The Superdiversity dataset includes the SI built on Twitter data and lexicon-based Sentiment Analysis. In addition, the dataset comprises other possible diversity measures calculated from the same data from which the SI is calculated, such as the number of tweets in the community language and the Type-Token Ratio, the number of languages in a community. Note The SI ranges in [0, 1]: a value of 0 means an emotional content very close between the computed valences and a standard emotional lexicon. a value of 0.5 indicates no correlation between the emotional content of words used by the community on Twitter and the standard emotional content. a value of 1 would correspond to the use of terms with the opposite emotional content compared to the standard. Data is computed at different three geographical scales based on the Classification of Territorial Units for Statistics (NUTS), i.e., NUTS1, NUTS2, and NUTS3, for two different nations, Italy and the United Kingdom.

Resource Category : Data: Tabular Data

Waiver

Our Community Norms as well as good scientific practices expect that proper credit is given via citation. Please use the data citation above, generated by the Dataverse.

No waiver has been selected for this data project.

Metadata will be made available under licence Creative Commons Attribution 4.0 International Public License

Confidentiality Declaration

Not available

Restrictions

Not applied

Citation Requirements

To follow the following example: Pollacci, Laura; Sirbu, Alina, 2024, "The Superdiversity Index", ttps://doi.org/10.17903/FK2/AVI6AH, SoDaNet Data Catalogue, version 1.

Depositor Requirements

The user must comply with the terms of the license and availability of data and metadata.

Conditions

For terms of use please see here

Disclaimer

For terms of use please see here

Restricted Files + Terms of Access

Restricted Files

There are 0 restricted files in this data project.

Data Access Place

The data are available through the SoDaNet Data Catalogue and Zenodo.

Availability Status

Files are available

Data Project Completion

Data project is complete

Guestbook

No guestbook is assigned to this data project, you will not be prompted to provide any information on file download.

Preview Guestbook

Upon downloading files the guestbook asks for the following information.

Guestbook Name

Collected Data

Account Information

Warning

The file(s) selected may not be downloaded.

Warning

The file(s) selected may not be downloaded.

Click Continue to download the files you have access to download.

Delete Data Project

Are you sure you want to delete this data project and all of its files? You cannot undelete this data project.

Delete Draft Version

Are you sure you want to delete this draft version? Files will be reverted to the most recently published version. You cannot undelete this draft.

Unpublished Data Project Private URL

Private URL can only be used with unpublished versions of data projects.

Unpublished Data Project Private URL

Are you sure you want to disable the Private URL? If you have shared the Private URL with others they will no longer be able to use it to access your unpublished data project.

Delete Files

The file(s) will be deleted after you click on the Delete button.

Files will not be removed from previously published versions of the data project.

Compute

This data project contains restricted files you may not compute on because you have not been granted access.

Deaccession Data Project

Are you sure you want to deaccession? The selected version(s) will no longer be viewable by the public.

Deaccession Data Project

Are you sure you want to deaccession this data project? It will no longer be viewable by the public.

Version Differences Details

Please select two versions to view the differences.

Version Differences Details

Version:
Last Updated:

Select File(s)

Please select a file or files to be downloaded.

Select File(s)

Please select a file or files for access request.

Select File(s)

Please select a file or files to be deleted.

Select File(s)

Please select unrestricted file(s) to be restricted.

Select File(s)

Please select restricted file(s) to be unrestricted.

Select File(s)

Please select a file or files to be edited.

Select File(s)

Please select a file or files to be edited.

Edit Tags

Select existing file tags or create new tags to describe your files. Each file can have more than one tag.

Request Access

You need to Sign Up or Log In to request access to this file.

Data Project Terms

Please confirm and/or complete the information needed below in order to continue.

Metadata will be made available under licence Creative Commons Attribution 4.0 International Public License

Package File Download

Use the Download URL in a Wget command or a download manager to download this package file. Download via web browser is not recommended. User Guide - Downloading a Dataverse Package via URL

- -

Download URL

https://datacatalogue.sodanet.gr/api/access/datafile/

Request Access

Please confirm and/or complete the information needed below in order to request access to files in this data project.

Metadata will be made available under licence Creative Commons Attribution 4.0 International Public License

Compute Batch

Clear Batch

Data Project	Data Project Persistent ID

Compute Batch

File Restrictions

Terms of Access

Request Access

Enable access request

Submit for Review

You will not be able to make changes to this data project while it is in review.

Publish Data Project

Are you sure you want to publish this data project? Once you do so it must remain published.

Publish Data Project

This data project cannot be published until HumMingBird Data is published. Would you like to publish both right now?

Once you publish this data project it must remain published.

Publish Data Project

Are you sure you want to republish this data project?

Save Changes

Publish Data Project

This data project cannot be published until HumMingBird Data is published by its administrator.

Publish Data Project

This data project cannot be published until HumMingBird Data and Research Data on Migration and the Refugee Crisis are published.

Return to Author

Return this data project to contributor for modification.

The Superdiversity Index

QUICK ACCESS