Original CountVectorizer is badly implemented - does not conform to ML pipeline interface, uses caching
in a leaking manner and etc. Here we fix part of the problems, but more is to be done.
TODO: Get rid of Vocabulary in constructor, adapt o ModelWithSummary interface.
Linear Supertypes
Params, Serializable, Serializable, Identifiable, AnyRef, Any
Created by eugeny.malyutin on 06.05.16.
Original CountVectorizer is badly implemented - does not conform to ML pipeline interface, uses caching in a leaking manner and etc. Here we fix part of the problems, but more is to be done.
TODO: Get rid of Vocabulary in constructor, adapt o ModelWithSummary interface.