Ali Saqallah

Logo

Data analyst with a master's degree in digital engineering, equipped with a strong foundation in quantitative analysis, data cleaning, and visualization. Proficient in Python and statistical methods, experienced in extracting actionable insights from complex datasets.

View My GitHub Profile

Data Analyst

Technical Skills: Python, Java, MATLAB, Microsoft Power BI, Dashboards, Problem Solving, Data Cleansing, Data Visualization (DataViz), Data Analysis, SQL, Tableau, R, KPI, SAP, QGIS, Statistics, Excel, Spreadsheets, Data Warehousing, Statistical Data Analysis, Git

Education

Work Experience

Thesis Researcher (Data Analyst) @ Webis.de, Germany (January 2023 - August 2023) Conducted research on real-world events’ impact on Wikipedia, utilizing data analysis and Python programming.

Civil Structural Engineer @ Creative Urban Designs, Jordan (April 2018 - September 2020)

Projects

Thesis: Quantifying the Effects of Real-World Events on Wikipedia

Link

Objective: Quantitatively analyze the influence of real-world events on English Wikipedia editing behavior.

Subobjectives:

Development of Analysis Methodology:

Step 1: Acquiring Editing Data

Relevant editing data obtained from the reproduction of Kiesel et al.’s [2017] paper “Spatio-temporal Analysis of Reverted Wikipedia Edits,” using authors’ open-source software.

Step 2: Determining Time Frames around Events

a) 12th month period:

b) 8-week period:

Step 3: Operationalizing Key Metrics

a) Total Edits: Measure of overall editing activity and engagement.

b) Reverted Edits: Indication of conflicts or disagreements among editors.

c) Vandalism-Reverted Edits: Highlighting malicious attempts to manipulate content.

d) Top Articles and their Contributions: Identifying topics that attract the most attention.

e) Number of Protected Articles: Understanding Wikipedia editors’ response to events.

Step 4: Selecting Relevant Wikipedia Articles

Definition of Relevant Article:

Four Approaches Explored:

a) Main Article: Limits editing behavior comparison before and after the event (Main articles often lack pre-event editing data).

b) Main Category: Many unrelated articles; missing relevant ones.

c) Directly Related Category: Inconsistent, lacks direct connections.

d) Title-Based Search (chosen): Iterative process for relevance, ensures high relevance to the event, and consistent.

Step 5: Quantifying Effects with Statistics

Results:

Quantification of Event Impact:

1) Overall Editing Activity:

2) Reverted Edits:

3) Vandalism-Reverted Edits:

4) Article Protection and Vandalism:

5) Editorial Biases:

6) Registered Users Contributions:

These results provide a comprehensive understanding of how real-world events impact English Wikipedia editing behavior, including increased activity, differences in reverted edits, vandalism-reverted edits, article protection, editorial biases, and the contributions of registered users during various events.