Evaluate, pick, and configure checking/linting tools for the Getting Involved Guide [#3072788]

Problem/Motivation

In developing a new and improved Getting Involved Guide, we want to make sure that we use clear, inclusive, accessible language. On #3069010: Develop writing style guidelines for the Getting Involved Guide, we discussed a style guide. This issue is about evaluating and picking tools to help us enforce additional things:

Accessibility -- in the body part of each page there should not be accessibility problems
Grammar/spelling
Readability -- exact grade level etc. to be determined

These automated tools for checking style may be useful. Note that "linter" in this context means "software that checks for problems":

https://errata-ai.github.io/vale/ -- Vale: linter that checks based on your style guidelines
https://textlint.github.io/ -- textlint -- rules-based engine for automatic text checking (you can choose which rules to use)
https://alexjs.com -- Alex: checks for bias (gender, race, etc.) in writing
Word processors, such as Microsoft Word, the online editor for Google Docs, and Open/Libre Office Writer (useful mostly for grammar/punctuation/spelling checks if you copy/paste text from a drupal.org page into them), or Hemingway app
Command-line tools like grep, if you copy/paste text from a drupal.org page into a plain text file. For instance, a shell script could be created that would download a particular URL, strip out the HTML (probably?), and verify that certain words are not present in the page.
Accessibility testing tools -- see https://www.drupal.org/docs/8/accessibility/how-to-do-an-accessibility-r...

Proposed resolution

1. Evaluate the tools (the ones listed above, and others we come across).
2. Figure out which tools will be useful in enforcing our style guidelines.
3. Figure out how to configure the tools we are using.

Remaining tasks

TBD.

Comments

Comment #1

5 August 2019 at 20:08

jhodgdon created an issue. See original summary.

Comment #2

jhodgdon

she/her

English

CreditAttribution: jhodgdon commented 5 August 2019 at 20:11

Parent issue:

#3069010: Develop writing style guidelines for the Getting Involved Guide

Comment #3

mdstamm CreditAttribution: mdstamm as a volunteer commented 8 August 2019 at 23:24

Created a Google sheet to collect feedback on tool testing based on conversations in Slack and in previous ticket Develop writing style guidelines for the Getting Involved Guide

Anyone who would like to test a tool and report your findings, please request permission on this document, and I will add you as an editor. Content checking tool analysis

Some tools are mentioned above, but if you have other tools in mind, please feel free to add. You may also test a tool that someone else has already tested; it would be great to have feedback from different types of users (i.e., "personas"). For example, I tested Vale, but I am not a developer or comfortable with command line, so my perspective will be different than that of a developer/someone who is comfortable in command line.

Comment #4

mradcliffe

he/him

English

CreditAttribution: mradcliffe at Kosada commented 10 August 2019 at 17:08

Thoughts so far:

Vale has a smaller install footprint due to being golang. But it does not have as nice of editor integration that is available from the JavaScript tools such as textlint or alex. With that integration, then a user wouldn't need to use the CLI, but could use their editor of choice.

There is a vscode extension usable with vale.

Rules are easier to write in vale than textlint or alex, which require JavaScript knowledge.

textlint seems to be the most automatable, and it might take less time to implement an automated test that polls drupal.org's API by node and runs the linter on pages. I'm not sure if that's desirable, but it could be.

Comment #5

lizzjoy

she/her

English

Ann Arbor

CreditAttribution: lizzjoy at Drupal Association commented 10 August 2019 at 17:47

Issue summary:

View changes

Added Hemingway app as a resource. It works well for grammar and readability checks.

Comment #6

lizzjoy

she/her

English

Ann Arbor

CreditAttribution: lizzjoy at Drupal Association commented 10 August 2019 at 17:48

Issue tags:

+mwds2019

Comment #7

mradcliffe

he/him

English

CreditAttribution: mradcliffe at Kosada commented 10 August 2019 at 18:16

It seems like textlint is much easier to integrate into a software js project so that all the rules can be installed from npm. This might make it difficult to integrate with editor software.

I wasn't able to get the vale vscode extension to do anything at all, but I may not have understood its documentation appropriately.

Edit: I figured out getting vale vscode to work, and it looks like it needs a local configuration file. Although npm may not be necessary to install rules it's still a directory and ini file that needs to be in the same directory. Again, this is editor configuration rather than automated or CLI access.

Comment #8

mradcliffe

he/him

English

CreditAttribution: mradcliffe at Kosada commented 10 August 2019 at 18:20

I tested in alexjs and vale, but gendered language used as a quote becomes a false positive lint error. This probably doesn't matter for documentation writing in these sections unless someone wants to write an inspirational quote.

Comment #9

jhodgdon

she/her

English

CreditAttribution: jhodgdon commented 10 August 2019 at 18:22

Can we get these results into the spreadsheet?

Comment #10

mradcliffe

he/him

English

CreditAttribution: mradcliffe at Kosada commented 11 August 2019 at 15:29

Posted in the spreadsheet. Thank you, @mdstamm.

For textlint I used the following package.json, but I didn't try the many many node modules that are listed on the github page:

{
  "name": "blah",
  "version": "1.0.0",
  "description": "",
  "main": "index.js",
  "scripts": {
    "test": "echo \"Error: no test specified\" && exit 1"
  },
  "author": "",
  "license": "GPLv2",
  "devDependencies": {
    "@textlint-rule/textlint-rule-no-unmatched-pair": "^1.0.7",
    "textlint": "^11.3.1",
    "textlint-rule-alex": "^1.3.1",
    "textlint-rule-diacritics": "^1.0.0",
    "textlint-rule-en-capitalization": "^2.0.2",
    "textlint-rule-spelling": "^0.1.1",
    "textlint-rule-unexpanded-acronym": "^1.2.3"
  }
}

Comment #11

jhodgdon

she/her

English

CreditAttribution: jhodgdon commented 13 August 2019 at 17:00

Title:	Evaluate and pick checking/linting tools for the Getting Involved Guide	» Evaluate, pick, and configure checking/linting tools for the Getting Involved Guide
Issue summary:	View changes

Adding a note that we will need to decide on tools and also configuration of the tools.

Comment #12

eojthebrave

he/him

English

Minneapolis, MN

CreditAttribution: eojthebrave at Drupalize.Me commented 20 August 2019 at 02:57

One thing that I've experienced using automated linting tools like these for copy editing is that it can be helpful if you can configure rules to throw an error vs. a warning. For example; our style guide might say "Drupal vs. drupal". and if you ever write "drupal" it's an error. And is required to be fixed. But, a lot of the rules are bound to be subjective, and depend on specific context. For example, a tool linting for gendered language is going to flag words like "her", or "his". But, there might be legitimate reason to use them. These should show as a warning to the user, but not an error.

Especially if we're talking about doing any kind of automation where we would likely want the tool-chain disallow something that doesn't pass all the tests.

In my limited experience vale is the best for this as each rule can be configured to report as a suggestion, warning, or error.

Additionally we should consider whether the tool can process asciidoc content as it might be useful to also use the same tool-chain for the user guide and other "official" docs which are currently in asciidoc format.