Citation:
J. Lin and P. Zhang, “Deconstructing nuggets: the stability and reliability of complex question answering evaluation,” Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, pp. 327-334, 2007.