Abstract: The need for a balanced, representative national scale corpus has been skyrocketing for the already `low resource' tagged language-Bangla. Many sporadic empirical works have been done so far ...