Appendix A: Difficulties Involved in Benchmark Construction