Chapter 18 Appendix A: Responsible AI Checklist
As we conclude this book, it is crucial to remember that technical skills are only half of the equation. Data science has real-world consequences. Before deploying any model, analysis, or reliable pipeline to production, use this checklist to ensure your work is robust, fair, and transparent.
This checklist is designed to be actionable for R users, pointing to specific packages and practices where applicable.
18.1 Data Quality & Lineage
“Garbage in, garbage out” applies to ethics as well as accuracy.*
-
- Tip: Use packages like
introdator custom scripts to scan for patterns resembling PII (emails, SSNs) before data leaves your secure environment.
- Tip: Use packages like
-
- Action: Check distribution of key demographics in your train vs. production sets.
-
- Tool: Use the
pointblankorvalidatorpackages to define and enforce data quality rules (e.g.,col_vals_between(age, 0, 120)).
- Tool: Use the
18.2 Fairness & Bias
Algorithms can reinforce existing inequalities.
-
- Tool: Use
fairness,fairmodels, ordalexto calculate metrics like Disparate Impact or Equal Opportunity difference. - Example Code:
fairness_check(explainer, protected = data$gender, privileged = "Male")
- Tool: Use
18.3 Transparency & Explainability
Black boxes should not make high-stakes decisions.
-
- Tool: Use
dalex,lime, orimlto create feature contribution plots or breakdown plots.
- Tool: Use