25 August 2022: Clinical Research  

Cross-Cultural Adaptation of the Godin-Shephard Leisure-Time Physical Activity Questionnaire for Arabic Population and Testing its Psychometric Properties

Faisal Asiri1ABEFG, Jaya Shanker Tedla1ABCDEF*, Ravi Shankar Reddy ORCID logo1ABCEF, Mastour Saeed Alshahrani1ABEFG, Adel Alshahrani2ABEF, Devika Rani Sangadala1ABCDEF, Kumar Gular1ABEF, Venkata Nagaraj Kakaraparthi1ABEF, Snehil Dixit1ABEF

DOI: 10.12659/MSM.937245

Med Sci Monit 2022; 28:e937245



BACKGROUND: Physical activity during leisure time is essential to promote health, owing to the decreased physical activity in mechanized working environments. The present study aimed to cross-culturally modify the Godin Shephard Leisure-Time Physical Activity Questionnaire (GSLTPAQ) into Arabic and to assess its psychometric properties.

MATERIAL AND METHODS: We conducted this study in various standardized stages. At each stage, the corrections were made by an expert committee. In the initial stage, the English version of the GSLTPAQ was translated into Arabic and then back-translated into English. In the second stage, we ensured the content validity by collecting the opinion of 10 professionals in the medical field. Finally, in the third stage, the Arabic version was applied to the Saudi population to check its test-retest reliability, face validity, internal consistency, and concurrent validity.

RESULTS: For the Arabic version of the GSLTPAQ, we evaluated the content validity by involving 10 experts, and it was found to be excellent. The scale was applied to 150 office workers in the university to assess psychometric properties. The scale showed remarkable internal consistency (0.99) and high test-retest reliability (0.88). We evaluated the concurrent validity by comparing it with the Copenhagen City Heart Study Leisure Time Physical Activity Questionnaire, and it was shown to have an excellent validity of 0.86 (<0.001).

CONCLUSIONS: After conducting the careful process of translations, we adapted and created the Arabic version of the GSLTPAQ. It was found to have excellent content validity, test-retest reliability, internal consistency, and concurrent validity.

Keywords: Medicine, Arabic, Sedentary Behavior, Surveys and Questionnaires


The current mechanization of occupations and the existence of the COVID-19 pandemic decrease the physical activity levels at work and promote various health-related issues due to physical inactivity [1]. People spend most of the day in their occupation, and a lack of physical activity at the workplace promotes negative health-related issues among workers. This is known as a pandemic of physical inactivity [2]. Current ergonomic practices are designed to ensure enough physical activity to create good working conditions. However, people now choose their leisure time to engage in physical activity and to enhance their health [3].

The Godin Shephard Leisure-Time Physical Activity Questionnaire (GSLTPAQ) is a standardized measure of physical activity levels during leisure time [4]. This is a short and easily applicable scale for use with the general population, and the examiner applying this scale does not require formal medical training. The GSLTPAQ has been applied widely in various diseased populations in oncology [5] and neurology [6]. It has been validated in children and adolescents and is used in these populations [7,8]. The GSLTPAQ has been extensively validated using the measure of the maximum amount of oxygen your body can utilize during exercise (VO2 max) for classifying people into 2 categories: insufficiently active and active [9]. The scale has been translated into Portuguese and validated in Brazil [5,10], and the components of the scale have even been translated into the Kinyarwanda language for use in Rwanda [11].

The GSLTPAQ measures 3 components: strenuous exercise (eg, running), moderate exercise (eg, fast walking), and mild exercise (eg, yoga). The questionnaire has a description of these 3 components, with an example of activities under each. We asked the participants to read each component carefully and write the number of times they did that exercise per week. Based on the number of times they did strenuous, moderate, or mild exercises, the GSLTPAQ leisure score index was calculated. The numbers of each type of exercise are multiplied further by constant numbers for each type of activity. For strenuous exercise, the number of times is multiplied by 9; for moderate exercise, it is multiplied by 5, and for mild exercise, by 3. For example, if the participants report they had done 2 sessions of strenuous exercise, 3 sessions of moderate exercise, and 4 sessions of mild exercise, their activities were multiplied by the constant numbers as follows: the number of sessions of strenuous exercise was multiplied by 9 (2×9=18), the number of sessions of moderate exercise was multiplied by 5 (3×5=15), and the number of sessions of mild exercise was multiplied by 3 (4×3=12). We added these resulting 3 scores to calculate the GSLTPAQ leisure score index: 18+15+12=45. If the GSLTPAQ leisure score index total scores were 24 and above, the participants were considered active, between 14 to 23 were considered moderately active, and less than 14 were considered insufficiently active [4].

Approximately 400 million people worldwide have Arabic as their primary language, and the typical Arabic-speaking person’s capacity to read and communicate in English is limited. Physical inactivity is a common issue among the Arabic population, leading to many health-related problems [12]. If we convert the commonly used leisure-time physical activity questionnaires into Arabic, it will help health care professionals and individuals know about physical activity levels. Owing to globalization, the Arabic population is engaged in the modern job market, which has resulted in their focusing less on cultural values. Without the availability of questionnaires such as the GSLTPAQ in Arabic, it will make it difficult for management and organizations to comprehend the level of physical activity during leisure time among their employees and take the necessary actions related to this issue. Our primary objective in this study was to cross-culturally adapt the GSLTPAQ to the Arabic population and test the psychometric properties, such as reliability and validity, for Arabic versions.

Material and Methods


We followed the guidelines recommended by Beaton et al [13] for translation and adapted the scale cross-culturally. We performed this translation and adapted the scale cross-culturally in the following 6 stages.


The study included 2 bilingual translators who we recognized for their native-level proficiency in Arabic and expertise in English. The first translator was a physical therapy professor, and the second was a native Arabic language expert without any medical background. We described the study’s aim to the translators and then asked them for their valuable contributions to the adaptation process. We provided them sufficient time to fully comprehend the scale and subsequently perform the translation function and create a translated version.


After completing the translation process, the translators notified the authors and finalized the Arabic version of the GSLTPAQ. Then, we met with the translators to combine their translation results. We named the translation done by the language specialist as “T1” and the translation done by medical specialist as “T2”. After the team’s discussion, the authors and translators formed a single Arabic version scale for the GSLTPAQ, based on T1 and T2.


In the back-translation stage, we selected 2 new translators who were bilingual in English and Arabic, in addition to the initial translators. Neither of these translators had any medical education but were native English speakers. We communicated with these translators separately, and both translators performed their translations independently. Therefore, we called their translations “BT1” and “BT2”.


In this step, we formed a panel consisting of authors, co-workers, and the 4 translators and discussed the translation and back-translation process. The panel discussed the findings of translations. Finally, the authors formulated a pre-final version of the Arabic version of the GSLTPAQ, with the committee members’ consensus for the testing process.


In this stage, we carefully selected 50 participants who were office workers at our university and were determined by using convenience sampling to test the pre-final Arabic version of the GSLTPAQ. We ensured that all the participants could speak both Arabic and English. First, all participants were provided with the Arabic GSLTPAQ forms and were asked to fill them out. Later, we provided the original English version of the GSLTPAQ to the participants for comparison. All participants reported the questions in both forms and made no dissimilarity in the reporting. They all said there were no differences in the questionnaire’s original English and Arabic versions. Two evaluators assessed this information; one was a language expert, and the other was a therapist. They interviewed each participant about questions, and their answers were discussed with the committee. Data collected from these participants were not used for the psychometric properties analysis of the Arabic version of the GSLTPAQ.


After completing the construction of Arabic versions of the GSLTPAQ, the expert committee approved the final version of the survey. Then, we sent the GSLTPAQ tool to the committee with all the information stages to formulate the Arabic version of the GSLTPAQ.


We sent the final approved version of the GSLTPAQ Arabic form to 10 bilingual physical therapists with doctorate degrees; they had at least 10 years of teaching and clinical experience. We provided all 10 professionals with the validity form, a consent form for their participation, the original English GSLTPAQ, and the Arabic version of GSLTPAQ to perform a validity assessment. The format for this assessment consisted of a 5-point Likert scale ranging from 1 (not acceptable) to 5 (excellent), through which the professionals rated all the items in the GSLTPAQ. For the validation process, the evaluators were provided with sufficient time to understand, analyze, and rate the form; we used the answers from the evaluators for the content validity assessment.


Using convenient sampling, we included 150 participants of both sexes, aged between 30 and 70 years, who were office workers and could read, write, and speak Arabic. We calculated the sample size by using the ClinCalc.com sample size calculator software. Based on a previous study’s mean values of GSLTPAQ scores, we used the anticipated mean, alpha values of 0.05, and power at 80%, and got a sample size of 150. Participants with diabetes, hypertension, psychological conditions, and neurological conditions, and those who were uncooperative or illiterate were excluded from the study. We performed the study at the Department of Medical Rehabilitation, College of Applied Medical Sciences of the university, and the study duration was 1 year.

PROCEDURE FOR INTERNAL CONSISTENCY, TEST-RETEST RELIABILITY, AND CONCURRENT VALIDITY: We received written informed consent from the participants who were willing to participate in the study and met the inclusion criteria. Therefore, we included the participants according to inclusion and exclusion criteria. The evaluator provided the Arabic version of the GSLTPAQ to all the literate participants, who were thus able to read, speak, and write in Arabic. The evaluator instructed the participants to read the scale carefully and then to record the number of sessions they did of strenuous, moderate, and minimal exercise in a week (7 days). The authors used the data obtained through the surveys to statistically analyze test-retest reliability and internal consistency. The evaluator allowed a 10-min break, and, afterward, all the participants filled out a Copenhagen City Heart Study Leisure Time Physical Activity Questionnaire [14] (CCHSLTPAQ) to assess concurrent validity. The same evaluator was present for all the participants during the administration of the GSLTPAQ and CCHSLTPAQ.

Within 1 week, the Arabic version of the GSLTPAQ was re-administered to all 150 participants, who filled out the scale based on the same evaluator’s instructions. No participants were shown the initial form they had filled 1 week prior; this lack of comparison ensured test-retest reliability.


We used SPSS statistics version 24.0 (IBM Corp, Armonk, NY, USA) for data analysis. The average (mean) and standard deviation of the demographic characteristics and the first and second ratings of the Arabic version of GSLTPAQ scores were analyzed via univariate analysis using descriptive statistics. The authors used the Shapiro-Wilk test to measure the normality of the variables. We used the content validity index to assess the content validity of the Arabic version of the GSLTPAQ. The method of the content validity index calculation proposed by Yusoff was used in this study. As per Yusoff’s article, any values greater than 0.83 were acceptable to say that the questionnaire had content validity. The formula proposed for the scale-level content validity index was the sum of item-level content validity index scores divided by the number of items [15]. To evaluate the test-retest reliability, we used the intraclass correlation coefficient (ICC). Koo et al suggested recommendations for reporting the ICC while performing reliability research. They proposed that ICC values <0.5 denote poor reliability, values between 0.50 to 0.75 denote moderate reliability, values between 0.75 to 0.9 denote good reliability, and values >0.90 denote excellent reliability [16]. The internal consistency of the Arabic version of GSLTPAQ was analyzed using Cronbach’s alpha, and acceptable values of alpha ranged from 0.70 to 0.95 [17]. We used the Pearson correlation coefficient to evaluate the concurrent validity of the Arabic edition of the GSLTPAQ compared with the scores of the CCHSLTPAQ. The rule for interpreting the Pearson correlation coefficient was as follows: 0.90 to 1.00 denoted very high correlation,.70 to.90 denoted high positive correlation,.50 to.70 denoted moderate correlation,.30 to.50 denoted low positive correlation, and.00 to.30 denoted negligible correlation [18].



During the translation of the English version of the GSLTPAQ, the Arabic translators experienced difficulty translating a few words, such as skiing and skating; they could not find exact corresponding Arabic words initially, but after discussing with experts, they were able to obtain the exact words. Similarly, the word “rapidly” was replaced with the word “fast”. The phrase became “like a fast heartbeat” rather than “heart beats rapidly”. Some questions were difficult to understand because they were translated from English to Arabic; for easier understanding, we used different words with the same general meaning in particular questions. For example, the phrasing “mild exercise: in the original English version of section 3 became “light exercise” in the Arabic version of the GSLTPAQ. We have included the new, completed Arabic version of the GSLTPAQ in Appendix 1.


A total of 150 office workers in the university contributed to the study, of whom 78 were men and 72 were women. Table 1 shows the means and standard deviations of age, height, weight, basal metabolic index, and first and second scores of the GSLTPAQ and CCHSLTPAQ.


The content validity of the GSLTPAQ was analyzed by utilizing the content validity index. The content validity assessment for the strenuous, moderate, and light physical activity components and the total GSLTPAQ scores revealed excellent results. The study assessed the test-retest reliability in 150 office workers via the ICC. This ICC test also demonstrated excellent reliability values of 0.88 for the total GSLTPAQ score. According to Cronbach’s alpha, we showed internal consistency with an r value of 0.86 (P<0.001) for the total GSLTPAQ scores. Pearson correlation demonstrated strong correlations for the leisure-time physical activity scores between GSLTPAQ and CCHSLTPAQ. Table 2 shows the content validity, internal consistency, test-retest reliability, and concurrent validity scores for the GSLTPAQ.


The GSLTPAQ is a simple, validated scale for assessing leisure-time physical activity. We conducted this first study to adapt the scale for an Arabic-speaking population in Saudi Arabia and provided satisfactory psychometric properties. Many researchers validated the GSLTPAQ to classify healthy and diseased populations into active and insufficiently active categories [9,14,15].

The adapted Arabic version of the GSLTPAQ had a content validity index of 0.95 and test-retest reliability with an r value of 0.88. Similar results were found by Sao Joao et al in their study to adapt and make the Portuguese version of the GSLTPAQ cross-culturally. They followed a similar process proposed by Beaton et al, conducting content validity by the content validity index, and performing test-retest reliability using the ICC. They successfully translated the questionnaire, with an excellent content validity index greater than 0.9 and an excellent test-retest reliability r value of 0.84 [21]. Furthermore, the Turkish version of the GSLTPAQ was assessed for psychometric properties in the diabetic population. They included 300 Turkish patients with diabetes to evaluate the reliability and validity. The content validity assessed by the content validity index showed good content validity of 0.82. They found excellent test-retest reliability, with an r value of 0.97 [22].

In the present study, to obtain concurrent validity, we compared the Arabic version of the GSLTPAQ with the self-reported scores of the CCHSLTPAQ and received an excellent r value of 0.86. Sao Joao et al analyzed the construct validity of the Brazilian version of the GSLTPAQ by comparing it with the Baecke habitual physical activity questionnaire components and self-reported measure of walking behaviors and found an r value of 0.02 to 0.62. These differences in validity may have been due to the differences in the types of scale chosen in the studies. The CCHSLTPAQ is a very similar scale to the GSLTPAQ, and both were created to assess physical activity during leisure time. Maybe this is why we obtained a higher validity than the previous studies [10].

Even though the sample size in the present study was appropriate, we performed the psychometric property assessment in healthy participants; further analysis should be established by applying it in a patient population. However, future studies can be done for the validity assessment with more sophisticated outcome measures, such as VO2 max. Moreover, the GSLTPAQ is an essential piece of evaluation in understanding people’s leisure-time physical activity. Finally, due to the scale’s simplicity and the minimal time required to complete the assessment, the GSLTPAQ can be adapted to many other cultural backgrounds.


The Arabic version of the GSLTPAQ was successfully translated and adapted for the Arabic population. The scale had excellent content validity of 0.95, which was assessed by 10 researchers using the content validity index. The internal consistency evaluated by Cronbach’s alpha was perfect, with an alpha value of 0.99. The ICC was used to calculate the test-retest reliability and showed outstanding reliability with an r value of 0.88. Finally, the concurrent validity was assessed by comparing it with the CCHSLTPAQ, and it showed very good validity, with a Pearson r value of 0.86. Overall, the Arabic version of the GSLTPAQ was shown to have excellent psychometric properties.


