Your training corpus is empty.
Post by Namrata HadimaniHi All,
I am trying to perform Language Model Training using the below command
~/mosesdecoder/bin/lmplz -o 3 <~/corpus/news-commentary-v8.fr-en.true.en > news-commentary-v8.fr-en.arpa.en
http://www.statmt.org/moses/?n=Moses.Baseline
<http://www.statmt.org/moses/?n=Moses.Baseline>
=== 1/5 Counting and sorting n-grams ===
Reading /home/namrata/smt/corpus/news-commentary-v8.fr-en.true.en
----5---10---15---20---25---30---35---40---45---50---55---60---65---70---75---80---85---90---95--100
Unigram tokens 0 types 3
=== 2/5 Calculating and sorting adjusted counts ===
Chain sizes: 1:36 2:4734547456 3:8877277184
terminate called after throwing an instance of
'lm::builder::BadDiscountException'
what(): /home/namrata/smt/mosesdecoder/lm/builder/adjust_counts.cc:53
in void
lm::builder::{anonymous}::StatCollector::CalculateDiscounts(const
lm::builder::DiscountConfig&) threw BadDiscountException because `s.n[j]
== 0'.
Could not calculate Kneser-Ney discounts for 1-grams with adjusted count
2 because we didn't observe any 1-grams with adjusted count 1; Is this
small or artificial data?
Try deduplicating the input. To override this error for e.g. a
class-based model, rerun with --discount_fallback
So I tried changing my command to the following as
: ~/mosesdecoder/bin/lmplz -o 3
<~/corpus/news-commentary-v8.fr-en.true.en >
news-commentary-v8.fr-en.arpa.en *--**discount_fallback*
After this When i run the command : ~/mosesdecoder/bin/build_binary \
news-commentary-v8.fr-en.arpa.en \
news-commentary-v8.fr-en.blm.en
I am facing error like: lm/vocab.cc:324 in void
lm::ngram::MissingSentenceMarker(const lm::ngram::Config&, const char*)
threw SpecialWordMissingException.
The ARPA file is missing </s> and the model is configured to reject
these models. Run build_binary -s to disable this check. Byte: 66
ERROR
Could anyone help me out with this, please.
Regards,
Namrata Hadimani
On Thu, 22 Apr 2021 at 18:32, Namrata Hadimani
Hi Hieu,
Thanks for the help, I am able to successfully compile the moses ToolKit.
Regards,
Namrata Hadimani
i've just successfully compiled moses-4.0 on ubuntu 20.04 with boost 1.71.
there's something wrong with your boost installation
There is atleast 60 GB space, so I think it is not space
issue. Has moses-4.0 been tested for compilation using boost
1.71? Do you recommend using boost 1.64 (as exemplified in
Moses manual?)
Thank you
On Thu, Apr 22, 2021 at 1:15 PM Hieu Hoang
there seems to be a problem with the boost library. Is the
disk full? Perhaps you should re-install boost
Post by ram anirudh cherukupallyDear Moses-Support,
Please find the build.log.gz attached as per the
instructions when the build failed.
OS: Ubuntu 20.04
RAM: 8 GB
libboost-dev version: 1.71.0.0ubuntu2
Command used for compiling moses: ./bjam -j4
Thanks and Regards
--
CH Ram Anirudh
_______________________________________________
Moses-support mailing list
http://mailman.mit.edu/mailman/listinfo/moses-support <http://mailman.mit.edu/mailman/listinfo/moses-support>
--
Hieu Hoang
http://statmt.org/hieu <http://statmt.org/hieu>
--
CH Ram Anirudh
--
Hieu Hoang
http://statmt.org/hieu <http://statmt.org/hieu>
_______________________________________________
Moses-support mailing list
http://mailman.mit.edu/mailman/listinfo/moses-support
<http://mailman.mit.edu/mailman/listinfo/moses-support>
_______________________________________________
Moses-support mailing list
http://mailman.mit.edu/mailman/listinfo/moses-support