[Bug 247397] New: w3m is our default HTML text dump tool and our system defaults to UTF-8, but w3m defaults to print entities with ASCII equivalents