Canadian Computing Competition: 2011 Stage 1, Senior #1
You would like to do some experiments in natural language processing. Natural language processing (NLP) involves using machines to recognize human languages.
Your first idea is to write a program that can distinguish English text from French text.
After some analysis, you have concluded that a very reasonable way of distinguishing these two languages is to compare the occurrences of the letters t
and T
to the occurrences of the letters s
and S
. Specifically:
- if the given text has more
t
andT
characters thans
andS
characters, we will say that it is (probably) English text; - if the given text has more
s
andS
characters thant
andT
characters, we will say that it is (probably) French text; - if the number of
t
andT
characters is the same as the number ofs
andS
characters, we will say that it is (probably) French text.
Input Specification
The input will contain the number
followed by
lines of text, where each line has at least one character and no more than
characters.
Output Specification
Your output will be one line. This line will either consist of the word English (indicating the text is probably English) or French (indicating the text is probably French).
Sample Input 1
3
The red cat sat on the mat.
Why are you so sad cat?
Don't ask that.
Output for Sample Input 1
English
Sample Input 2
3
Lorsque j'avais six ans j'ai vu, une fois,
une magnifique image,
dans un livre
Output for Sample Input 2
French
(Note: Sample Input 2 is the first sentence of Le Petit Prince by Antoine de Saint-Exupéry.)
Sample Input 3
4
Si je discernais ta voix encore
Connaissant ce coeur qui doute,
Tu me dirais de tirer un trait
Quoi que partir me coute.
Output for Sample Input 3
English
(Note: Sample Input 3 is added by DMOJ from Le Fantôme de l'Opéra.)
Comments
This comment is hidden due to too much negative feedback. Show it anyway.
consider this sample:
your code would output 'English', even though there are more 's' characters in total. (also, don't post code in comments)
This comment is hidden due to too much negative feedback. Show it anyway.
noo volcano dont quit youre very smart and helpful
I'm really at a loss here. I've run my code through every sample I can think of and it always gives the correct answer, but when I try to submit it, I'm told it gives a wrong answer...
that is happening to me too
I can't get past test case 8. Any ideas to help a new guy out?
The problem statement clearly says to check for uppercase and lowercase characters.
Also, here's a tip: Instead of using a for loop, you can utilize Python's inbuilt count function.
Good suggest! Thank you so much! You help me a lot!
happy to help :)
I thought I handled the uppercase/lowercase issue using the .lower(). I'll give the .count() a shot and see what happens, thanks.
You aren't reading all the input. There will be N+1 lines of input (N lines of actual text). You only read a single line of input.
You should use the DMOJ Discord server to get help faster.
I submit my code, and it can not pass through all the test? What is the problem of my code?
Check the input specification. You need to be reading N lines of text
I don't understand the input in this problem. For Python do we just have the one input N? for example:(n = int(input). Or do we have to somehow code to receive an unknown amount of input?
So say the first input is n, then you have to make the code input n number of times before it ends. In example one, the first value was 3 so the code only allowed to input 3 other lines of codes.
Make sure you remember to check for capital letters as well!
im lost, my thing reads all three lines and is outputting correctly. what am i missing?
Your code only looks at the last line of text. Try a test case with two lines of English and one line of French.
OH, ok thanks! I was testing if the first one was messed up only!
This comment is hidden due to too much negative feedback. Show it anyway.
This comment is hidden due to too much negative feedback. Show it anyway.
Ok, but for what is the integer? it does not make any sense. im getting a wrong answer just on one test, if u could take a look at my code and say in what im messing it, i would be grateful. Thanks bro.
The number represents the number of lines of text. Your code fails because it only looks at the first one.
how do I keep getting test case two wrong i've tested my code out on tons of stuff and it seems functional to me anybody know why?
nvm I had a uppercase letter where I shouldnt have
Does anyone know how to solve time limit issues with java in case 4?
There's some repeated code in your submission that can be condensed. Also, consider writing a more efficient solution.
Ok, thank you!
The question states:
However, in test case 3, I output the number of t's and s's and apparently there is 0 of each of them. But the solution is "English". I had to hardcode it to solve the problem. Please do something about it.
Edit: or was there a problem with my code?
You are forgetting that C/C++ style input stops with all whitespace characters, so doing this:
will read but a single word from the first line of the input. What you should be doing is reading the entire line:
I already AC'd by hard-coding, but why does this solution using getline WA case 3?
You should insert
cin.ignore()
between lines 9 and 10 to ignore the new line character which has not been read bycin >> N
.Worked, thank you!
Can anyone tell me what's wrong with my test case 3?
Your submission is testing each line for its language individually and then choosing the language that appeared most often in all the lines. But that's not what the question asks for.
Thx, I couldn't figure out what was wrong in my code. English is not my mother language so can't pick up subtle things. I thought that "given text" was referring to each input text line. With your comment I realize it was referring to the full text.
I'm not sure what's wrong with my code, I failed test case 3. Can someone help?
I don't see anything wrong with your actual code. I believe that the problem is that right after you have scanner.nextInt(), when you use scanner.nextLine() it actually reads the same line of the integer, meaning it would read nothing. So just use an extra scanner.nextLine() in between your reading of "a" and your lines, and don't use it. That hopefully should fix the problem.
It worked thank you so much!
I speak french, and t actually works at guessing the language! Cool problem!
This comment is hidden due to too much negative feedback. Show it anyway.
Your program only counts then individually
i do not just count but i've compared the caracters between them too?
I dont see how i could get test case 3 wrong. Its just a simple comparison...
Try rereading the problem statement, especially the classification criteria.
The problem was that i had an int scanner before a String
To be more specific,
Scanner#nextInt
doesn't go to the new line of the input. In order to force it onto the next line, you need to append aScanner#nextLine
to it.This comment is hidden due to too much negative feedback. Show it anyway.
Read the statement again (about equal).
This comment is hidden due to too much negative feedback. Show it anyway.
Either use
time.
StringBuilder
or process each line as you read it. Concatenating all the lines takesThis comment is hidden due to too much negative feedback. Show it anyway.